Machine Learning Model for Identifying Dutch/Belgian Biodiversity

Laurens Hogeweg,Maarten Schermer,Timo Roeke,Wilfred Gerritsen,Sander Pieterse

doi:10.3897/biss.3.39229

Laurens Hogeweg, Maarten Schermer + Show 3 more

Open Access

https://doi.org/10.3897/biss.3.39229

Copy DOI

Journal: Biodiversity Information Science and Standards	Publication Date: Aug 20, 2019
Citations: 2	License type: CC BY 4.0

Abstract

The potential of citizen scientists to contribute to information about occurrences of species and other biodiversity questions is large because of the ubiquitous presence of organisms and friendly nature of the subject. Online platforms that collect observations of species from the public have existed for several years now. They have seen a rapid growth recently, partly due to the widespread availability of mobile phones. These online platforms, and many scientific studies as well, suffer from a taxonomic bias: the effect that certain species groups are overrepresented in the data (Troudet et al. 2017). One of the reasons for this bias is that the accurate identification of species, by non-experts and experts, has been limited by the large number of species that exist. Even in the geographically limited area of the Netherlands and Belgium, the number of species that are regularly observed are in the thousands. This makes the ability to identify all those species difficult or impossible for an individual. Recent advances in species identification powered by deep learning, based on images (Norouzzadeh et al. 2018), suggest a large potential for a new set of digital tools that can help the public (and experts) to identify species automatically. The online observation platform Observation.org has collected over 93 million occurrences in the Netherlands and Belgium in the last 15 years. About 20% of these occurrences are supported by photographs, giving a rich database of 17 million photographs covering all major species groups (e.g., birds, mammals, plants, insects, fungi). Most of the observations with photos were validated by human experts at Observation.org, creating a unique database suitable for machine learning. We have developed a deep learning-based species identification model using this database containing 13,767 species, 1,530 species-groups, 734 subspecies and 117 hybrids. The model is made available to the public through a web service (https://identify.biodiversityanalysis.nl) and through a set of mobile apps (ObsIdentify). In this talk we will discuss our technical approach for dealing with the large number of species in a deep learning model. We will evaluate the results in terms of performance for different species groups and what this could mean to address part of the taxonomic bias. We will also consider limitations of (image-based) automated species identification and determine venues to further improve identification. We will illustrate how the web service and mobile apps are applied to support citizen scientists and the observation validation workflows at Observation.org. Finally, we will examine the potential of these methods to provide large scale automated analysis of biodiversity data.

Highlights

Recent advances in species identification powered by deep learning, based on images (Norouzzadeh et al 2018), suggest a large potential for a new set of digital tools that can help the public to identify species automatically
About 20% of these occurrences are supported by photographs, giving a rich database of 17 million photographs covering all major species groups
Most of the observations with photos were validated by human experts at Observation.org, creating a unique database suitable for machine learning

Summary

Introduction

Recent advances in species identification powered by deep learning, based on images (Norouzzadeh et al 2018), suggest a large potential for a new set of digital tools that can help the public (and experts) to identify species automatically. The online observation platform Observation.org has collected over 93 million occurrences in the Netherlands and Belgium in the last 15 years.

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Machine Learning Model for Identifying Dutch/Belgian Biodiversity

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biodiversity Information Science and Standards

Lead the way for us

Similar Papers

Supporting citizen scientists with automatic species identification using deep learning image recognition models
Maarten Schermer ... Laurens Hogeweg
Biodiversity Information Science and Standards | VOL. 2
Maarten Schermer, et. al.Maarten Schermer ... Laurens Hogeweg
17 May 2018
Biodiversity Information Science and Standards | VOL. 2

Factors Influencing Data Quality in Citizen Science Roadkill Projects
Heigl Florian ... Zaller Johann
Frontiers in Environmental Science | VOL. 4
Heigl Florian, et. al.Heigl Florian ... Zaller Johann
01 Jan 2015
Frontiers in Environmental Science | VOL. 4

Addressing bumblebee faunistic and ecology using Citizen Science – reviewing a two years’ experience
Neumayer Johann ... Pachinger Bärbel
Frontiers in Environmental Science | VOL. 4
Neumayer Johann, et. al.Neumayer Johann ... Pachinger Bärbel
01 Jan 2015
Frontiers in Environmental Science | VOL. 4

Plant Diversity Hotspots in the Atlantic Coastal Forests of Brazil
Charlotte Murray‐Smith ... Eimear M Nic Lughadha
Conservation Biology | VOL. 23
Charlotte Murray‐Smith, et. al.Charlotte Murray‐Smith ... Eimear M Nic Lughadha
14 Jan 2009
Conservation Biology | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Machine Learning Model for Identifying Dutch/Belgian Biodiversity

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biodiversity Information Science and Standards