The Plant Identification Battle: GPT-4 vs. Plant.id

Ondřej Veselý

March 21, 2024

•

7 min read

Share this post

How we did it?

We randomly selected 50 photos submitted through the Plant.id API that were later confidently identified by botanical experts.

We have submitted the photos through:

3.1.0. model from the Plant.id API,
multimodal GPT-4 Turbo model through the OpenAI API.

In GPT-4, the following prompt was used to obtain the three best suggestions:

‍Can you tell me the name of this plant? Give me your three best ideas as scientific names.
‍

Here is an example of an answer:

‍

Based on the image provided, the plant appears to be a type of variegated foliage plant commonly used as an indoor ornamental. Here are three possible scientific names for plants with a similar appearance:

1. Dracaena fragrans 'Lemon Lime' – A member of the Dracaena genus with variegated leaves that can resemble the plant in the photo.

2. Chlorophytum comosum 'Variegatum' – Also known as the variegated spider plant, which has striped leaves that could be similar to what is shown in the image.

3. Aglaonema commutatum – Recognized for its variegated leaves, which might look like the plant in the picture.

Without a more precise view or additional information about the plant's characteristics, such as flowers or growth habit, it's difficult to determine the exact species just from the image.

‍

We evaluated whether the correct identification was in the first suggestion (top1) or in one of the first three suggestions (top3). We evaluated matches at the species level (e.g. Chlorophytum comosum) and the genus level (e.g. Chlorophytum).

What have we found?

On the species level, Plant.id had almost five times less misidentifications (12%) than GPT-4 (58%) for top1 suggestions. For top3 suggestions, the difference was even greater, with Plant.id giving 4% wrong answers and GPT-4 giving 36% wrong answers.

The overall error rate was about twice as low on the genus level but the difference in performance between the models was similar: for top1 suggestions, Plant.id was wrong about 4x less often than GPT-4, with 8% and 36% of wrong answers, respectively; for top3 suggestions, it was 10x less often, with 2% and 20%, respectively.

Although GPT-4 Turbo can be an interesting solution for people who want to identify plants, the results show that narrowly focused models such as Plant.id are still superior in terms of identification accuracy.

The results, including the photos and GPT responses, can be found in this spreadsheet.

*Proportion of wrong answers (the smaller the column, the better the result).*

‍Methods details

Plant images were collected from February 16 to February 25, 2024. Genus and species names were considered, plant varieties (if present) were ignored for the evaluation. If a synonymous name was given, it was considered a match (e.g. the genera Schefflera and Plerandra). If a plant was identified only at the genus level (such as Rosa sp. or Pinus sp.; 13 cases), the response from the compared models was considered a match if the species level was provided (such as Rosa chinensis or Pinus halepensis). In three cases, GPT-4 refused to give the requested answer; these cases were considered as wrong answers.

Most of the selected plants (60%) were labeled as houseplants by an expert, reflecting the typical traffic of plant identification apps during winter in the northern hemisphere.

‍

Share this post

plant.id

Ondřej Veselý

cofounder, CEO, ideamaker, kindwise

We thought you might also like these.

View all

A sick plant with a mobile phone with a question on its display

plant-id

New plant.health feature: follow-up questions

With this new feature, users can receive automated, targeted, easy-to-answer, expert-backed questions to help refine potential issues and guide plant care treatments.

plant-id

7 min read

Using LLMs for Descriptions of Plants, Insects, and Fungi

We tested LLMs to improve multilingual descriptions of plants, insects, and fungi where Wikipedia entries were inadequate. GPT-4o outperformed smaller models such as GPT-4o mini and Mistral Pixtral 12B, especially in grammar and factual accuracy. It performed better in widely spoken languages and common species. In the end, GPT-4o was used to generate descriptions in 28 languages, providing better coverage for the Kindwise API.

plant-id

3 min read

Disease symptoms

This feature provides clearer and more actionable insights into plant health.

plant-id

5 min read

Summer 2024 Plant.id Model Update

The Plant.id model upgrade resulted in significant improvements in accuracy. It reached over 85% accuracy in the first response (TOP 1). The increase is greatest for European and North American plants, with 6.0% and 3.6%, respectively.

plant-id

5 min read

Suggestion filters: create custom species lists

Plant.id now allows plant species to be filtered by various criteria such as geographical distribution, morphology, ecology or practical use. This increases identification accuracy and confidence in the model.

plant-id

6 min read

Kindwise meets large language models

We started looking for synergies between our narrow ML models and LLMs. Let's play with this together on our demo 🙂

plant-id

1 min read

Plant.id excells in urban forest biodiversity study

Plant.id emerged as the leading application for the automatic identification of tree species, outperforming other methods in accuracy and efficiency.

plant-id

3 min read

API Search for plant/insect/mushroom details by its name

Our endpoint allows our clients to search through huge botanical, fungi or insect database by latin name or common names to get various species info

plant-id

5 min read

Elevate plant identification with varieties

As the world of houseplants and ornamental varieties continues to flourish, many people now seek more than species identification. Our new 'Varieties' model responds to this demand and offers you a unique opportunity to gain a competitive advantage by enabling the identification of almost 1,000 plant varieties.

plant-id

6 min read

October Plant.id upgrade

We have upgraded the Plant.id model, increasing the number of classes from 33,325 to 35,756. The new version, released in October 2023, shows significant performance improvements over the previous models, especially in Europe and North America.