Using LLMs for Descriptions of Plants, Insects, and Fungi

Ondřej Veselý

February 10, 2025

•

7 min read

Share this post

Background

Kindwise’s APIs initially relied on Wikipedia to provide more information on plants, insects, and fungi in our knowledge base. This information is important for Kindwise clients who want to display general descriptions in their product. Wikipedia entries, while multilingual, vary significantly in coverage across languages. For instance, English entries cover up to 82.7% of entities in our insect.id database, while less common languages account for only a few per cent.

To fill these gaps, we tested several public large language models (LLMs) for their accuracy, language correctness, and relevance in generating information. Here, we share insights from our experiments, which may help others considering LLMs for similar purposes.

Models Tested

We have tested the following LLMs:

We used temperature=0 and top p=1 to prioritize factual accuracy and to get consistent results.

Prompt Design

Our base prompt:

Please write a concise, factually accurate Wikipedia-style paragraph about [plant name] (family [family name]) in [language]. Use scientific names only, avoiding any common names or colloquial descriptions. Omit unnecessary details like awards. Do not mention flower arrangement or excessive morphological details.

We experimented with English and Czech, as a less common (and our native) language. We included taxonomic placement (e.g., family name) to reduce confusion. The prompt was modified slightly based on organism type and whether the entity was common or rare.

Findings

1. GPT-4o vs GPT-4o mini

Not surprisingly, the GPT-4o clearly outperformed the GPT-4o mini, especially in grammar.

2. Performance on common vs. rare organisms

Models generally excelled with common organisms but strongly hallucinated for rare species. They also included general information of little informational value, such as "this species contributes to biodiversity," reflecting the uneven availability of information across species.
Allowing the model to decide what information to include (instead of requiring specific information like distribution or uses) reduced error rates.

3. Language-specific challenges

Descriptions in less common languages often contained inaccuracies. For example, here’s the output for Combretum coccineum:

‍English (GPT-4o mini):

Combretum coccineum is a species in the family Combretaceae, native to tropical regions of Africa. It is characterized by its climbing habit and can reach significant heights, often utilizing surrounding vegetation for support. The leaves are opposite, simple, and entire, with a glossy appearance. The species is known for its distinctive red to orange inflorescences, which attract various pollinators. Combretum coccineum is often found in disturbed areas, forest edges, and along riverbanks, thriving in well-drained soils and requiring full sunlight for optimal growth.

Czech (GPT-4o mini):

Combretum coccineum je druh rostliny patřící do čeledi Combretaceae. Tento druh je rozšířen v tropických oblastech Afriky, zejména v savanách a lesních okrajích. Combretum coccineum je vytrvalá dřevina, která dosahuje výšky až 10 metrů. Listy jsou jednoduché, střídavé a mají celokrajné okraje. Plody jsou dřevnaté tobolky, které obsahují semena s křídly, což usnadňuje jejich šíření větrem.

The Czech output wrongly states that the leaves are alternate. It describes the fruits as woody capsules (“dřevnaté tobolky”) but they are actually achenes (“nažky”).
Smaller models (Pixtral 12B and GPT-4o mini) frequently made language errors in Czech, whereas GPT-4o made almost no mistakes.

The performance gap between English and other languages can likely be extrapolated to other less common languages. However, this gap is narrower in widely spoken languages like German, French, or Spanish and wider in languages with limited online resources.

Implementation in the API

Finally, we decided to proceed with the generation of the descriptions for 28 major languages using GPT-4o, despite higher pricing. For less common organisms, where the error rate was highest, we mitigated the improved accuracy by feeding the English wiki description into the prompt although the model tended to translate it literally (even when asked not to). Descriptions of the rarest organisms were omitted in less common languages, as these often resulted in the highest error rates. Here are the description coverage for plant.id and insect.id.

The error rate is relatively low. In a sample of 30 plants and 30 insects, we found only one factually incorrect information in both plant.id and iInsect.id, one partially wrong information in Insect.id and two grammatically inaccurate cases in both plant.id and insect.id. You can see the specific cases highlighted in this spreadsheet.

For the implementation (plant.id, insect.id), use the description_gpt parameter. For the combined descriptions from Wikipedia and GPT-generated descriptions, use the description_all parameter.

Share this post

plant.id

insect.id

Ondřej Veselý

cofounder, CEO, ideamaker, kindwise

We thought you might also like these.

View all

A sick plant with a mobile phone with a question on its display

plant-id

New plant.health feature: follow-up questions

With this new feature, users can receive automated, targeted, easy-to-answer, expert-backed questions to help refine potential issues and guide plant care treatments.

plant-id

3 min read

Disease symptoms

This feature provides clearer and more actionable insights into plant health.

plant-id

5 min read

Summer 2024 Plant.id Model Update

The Plant.id model upgrade resulted in significant improvements in accuracy. It reached over 85% accuracy in the first response (TOP 1). The increase is greatest for European and North American plants, with 6.0% and 3.6%, respectively.

plant-id

5 min read

Suggestion filters: create custom species lists

Plant.id now allows plant species to be filtered by various criteria such as geographical distribution, morphology, ecology or practical use. This increases identification accuracy and confidence in the model.

plant-id

6 min read

Kindwise meets large language models

We started looking for synergies between our narrow ML models and LLMs. Let's play with this together on our demo 🙂

plant-id

7 min read

The Plant Identification Battle: GPT-4 vs. Plant.id

Multimodal Large Language models have recently made it possible to identify various objects from images, including plants. This became a competition for narrowly focused neural network models, such as that of Plant.id. Here we compare the accuracy of these models by focusing on the proportion of wrong answers.

plant-id

1 min read

Plant.id excells in urban forest biodiversity study

Plant.id emerged as the leading application for the automatic identification of tree species, outperforming other methods in accuracy and efficiency.

plant-id

3 min read

API Search for plant/insect/mushroom details by its name

Our endpoint allows our clients to search through huge botanical, fungi or insect database by latin name or common names to get various species info

plant-id

5 min read

Elevate plant identification with varieties

As the world of houseplants and ornamental varieties continues to flourish, many people now seek more than species identification. Our new 'Varieties' model responds to this demand and offers you a unique opportunity to gain a competitive advantage by enabling the identification of almost 1,000 plant varieties.

plant-id

6 min read

October Plant.id upgrade

We have upgraded the Plant.id model, increasing the number of classes from 33,325 to 35,756. The new version, released in October 2023, shows significant performance improvements over the previous models, especially in Europe and North America.