AI in palynological taxonomy
Introduction
Artificial Intelligence is becoming an interesting avenue of research in geology. Recently the International Union of Geological Sciences (IUGS) Deep-time Digital Earth (DDE) team, and partners Alibaba Cloud, have begun to develop a large language model for geology known as GeoGPT. The team also wanted to know if large language model methods could augment a traditional taxonomic key. Could a palynologist sitting by their microscope get some help from a Large Language Model (LLM) in determining a species? It turns out that palynology very much lends itself to developing a professional LLM that might be very useful in teaching taxonomy to apprentice palynologists, or to professionals in environments that require expertise in many areas of palynology. It could also have particular use in Global South countries where access to reference materials may be difficult.
LLM-aided taxonomy is text based, and will probably be delivered through a series of structured questions and answers that converge on a determination, rather than being based on image recognition. There are already some quite sophisticated image-based methods of identification in palynology (e.g. Mahmood et al., 2023; Chronosurveys, 2024; Barnes et al., 2023). However a text-based system could have some big advantages over an image-recognition system.
Taxonomic keys
In palaeontology, an identification key or taxonomic key can be in the form of a printed series of notes and instructions that help you identify a palynomorph (or any fossil), perhaps at genus or species level. Identification keys are also used in many other scientific and technical fields to identify or diagnose diseases, minerals, or archaeological artifacts.
Most keys provide a fixed sequence of identification steps, each with multiple alternatives, the choice of which determines the next step (Fig. 1). When the key stage has two alternatives, the stage is dichotomous, when there are more choices, the stage in the key is polytomous.
At each step, the user has to make a choice about the characters of the fossil being identified. In palynology there are a range of possible starting points and so the person using the key has to know something about the morphology of the range of palynomorphs that might be encountered.
Palynological taxonomy
Discussions with specialists in LLMs have shown that palynology and other newer branches of the science of palaeontology may have some special advantages for the development of LLMs because they can provide ‘learning materials’ that are clearly and consistently structured. The hierarchical structure of descriptions of palynological species also provide signposts for the LLM in developing the order of questions.
So what might an LLM-aided taxonomic key look like? Well perhaps the system might ask the palynologist a series of questions like: what is the shape of the palynomorph and what is its size? In other words the LLM key would ask a series of relevant intelligent questions in a particular order, prompting the user to make sensible taxonomic decisions getting closer and closer to a determination. This imitates the traditional key, only a computer is asking the questions. The value of this method over identification using images (e.g. Mahmood et al. 2023) is that the taxonomist is guided through a series of steps that help them understand how taxonomy works. In other words, it is not a ‘black box’.
Imagine if the LLM key was helping you identify a car that you had seen in a car park. It might ask, what’s the badge on the car? what’s the shape of the headlight, how many headlights does the car have? What is the shape of the radiator grill? After a series of relevant intelligent questions and ‘taxonomic’ decisions you might arrive at an answer at ‘species level’ – so the car is perhaps identified as a Tesla Model S. The same might be true of the palynological taxonomy LLM.
As previously suggested, palynology lends itself to an LLM because the ‘learning materials’ are clearly and consistently structured. What we mean by this is that palynology, being a relatively young branch of palaeontology, is ‘written down’ in a fairly consistent way, i.e. the descriptions and diagnoses of its species are generally consistent in form. This is because most of the descriptions and diagnoses have been written in the last few decades, rather than over a century or more, as is the case with older branches of palaeontology. Mostly nomenclature has remained consistent because descriptive terms mean the same as they have always meant - and measurements are in consistent units. Another advantage is that palynological descriptions and ‘diagnoses’ are remarkably similar in structure and content. Mostly descriptions follow a particular pattern. For example for a spore, the description may begin ‘spores, radial, trilete; amb circular; laesurae distinct, with narrow lips’. For a pollen grain (in this case a monosaccate pollen grain), it might begin ‘pollen, monosaccate, radially symmetrical, trilete; amb circular’. The description goes on to become more detailed. But in a way, the description is offering a hierarchy, in the sense that it begins with the big elements (things like shape and symmetry), and then delves deeper. So as ‘learning material’ for the LLM, it has a built-in step by step process.
Looking deeper, you can analyse each species description in terms of ‘keys’ and ‘values’. Keys are character types, and values are the descriptors, the adjectives. Figs 2 and 3 show a set of ‘keys’ and ‘values’ for a species of saccate pollen and a spore. The keys are signposts for parts of the decision tree, and the values are possible ‘answers’ (like in a drop down menu). Some keys will have a limited number of ‘values’; other keys will have values that are more variable and descriptive with a very large number of possible ‘answers’, and perhaps will need to accommodate continuous variation or subtle shades of difference. For these keys it would be very difficult and restrictive to apply a small number of value 'choices'.
Taxonomic keys are not perfect because they depend on the knowledge or judgment of the person that designed them. They also depend on the quality of the descriptions and diagnoses, but once an appropriate description structure (as in Figs. 2 and 3) is created, the LLM will do the rest of the work assuming that it is given plenty of high quality material to ‘learn’ with. In the cases above, the descriptions are from Mike Stephenson’s PhD thesis – and these are not ‘official’ diagnoses or descriptions (i.e. those of the original authors), and they could be flawed. Much more material will be needed (perhaps descriptions of thousands of species), from other PhD theses and databases, and eventually from the peer reviewed literature, so permissions and approvals will have to be gained. Even with these shortcomings, taxonomic keys could be used in teaching and learning, helping students to quickly gain a grasp of the basics of taxonomy. They could be particularly useful in the Global South where even now information is hard to come by (e.g. Nobes and Harris 2019).
Could an LLM-guided taxonomic key be even better than a manual key? Whatever the answer to that question, a traditional or LLM key should not replace the need for students to use taxonomic literature (for example published diagnoses and descriptions). But the use of a well-designed LLM key would have considerable pedagogic value. In a commercial environment, for example in companies that use stratigraphic palynologists, LLM-guided keys could help in standardising taxonomic procedures making taxonomy more reliable and therefore correlation and stratigraphy more reliable.
Research is continuing and it is hoped that a pilot system will be released soon for testing.
References
Barnes, C.M., Power, A.L., Barber, D.G., Tennant, R.K., Jones, R.T., Lee, G.R., Hatton, J., Elliott, A., Zaragoza-Castells, J., Haley, S.M., Summers, H.D., Doan, M., Carpenter, A.E., Rees, P. and Love, J., 2023, Deductive automated pollen classification in environmental samples via exploratory deep learning and imaging flow cytometry. New Phytol, v. 240, p. 1305-1326. https://doi.org/10.1111/nph.19186
Chronosurveys 2024 https://www.chronosurveys.com/research/
Mahmood, T., Choi, J., and Ryoung Park K., 2023. Artificial intelligence-based classification of pollen grains using attention-guided pollen features aggregation network, Journal of King Saud University - Computer and Information Sciences, v. 35, p. 740-756.
Nobes, A., and Harris, S. 2019. Open Access in developing countries – attitudes and experiences of researchers. https://doi.org/10.5281/zenodo.3464868