On the impact of Citizen Science-derived data quality on deep learning based classification in marine images.

Daniel Langenkämper,Daniel O B Jones,Erik Simon-Lledó,Tim W Nattkemper,Brett Hosking,Cem M Deniz

doi:10.1371/journal.pone.0218086

Daniel Langenkämper, Daniel O B Jones + Show 4 more

Open Access

https://doi.org/10.1371/journal.pone.0218086

Copy DOI

Abstract

The evaluation of large amounts of digital image data is of growing importance for biology, including for the exploration and monitoring of marine habitats. However, only a tiny percentage of the image data collected is evaluated by marine biologists who manually interpret and annotate the image contents, which can be slow and laborious. In order to overcome the bottleneck in image annotation, two strategies are increasingly proposed: “citizen science” and “machine learning”. In this study, we investigated how the combination of citizen science, to detect objects, and machine learning, to classify megafauna, could be used to automate annotation of underwater images. For this purpose, multiple large data sets of citizen science annotations with different degrees of common errors and inaccuracies observed in citizen science data were simulated by modifying “gold standard” annotations done by an experienced marine biologist. The parameters of the simulation were determined on the basis of two citizen science experiments. It allowed us to analyze the relationship between the outcome of a citizen science study and the quality of the classifications of a deep learning megafauna classifier. The results show great potential for combining citizen science with machine learning, provided that the participants are informed precisely about the annotation protocol. Inaccuracies in the position of the annotation had the most substantial influence on the classification accuracy, whereas the size of the marking and false positive detections had a smaller influence.

Highlights

In recent years computer vision has made a big leap forward in tackling some of the most demanding problems such as detection of cars or people in photos, owing to the emergence of deep learning [1, 2]
We investigate the potential of such error-prone citizen science object detections in combination with powerful deep learning classifiers
The image collection {In,n=1. . .N}, where N is the total number of images, used in this work is from a Pacific region referred to as the Area of Particular Environmental Interest 6 (APEI-6), centered on 122 ̊ 55’ W, 17 ̊ 16’ N

Summary

Introduction

In recent years computer vision has made a big leap forward in tackling some of the most demanding problems such as detection of cars or people in photos, owing to the emergence of deep learning [1, 2]. Deep learning methods for image classification and object detection were successfully proposed but mostly limited to everyday image domains, i.e. images showing “everyday objects” from human civilization such as cars, furniture, people. On the impact of citizen science-derived data quality on deep learning based classification in marine images. 03F0707C), as well as ESL, DOBJ under the framework of JPI Oceans. The funder provided support in the form of salaries for authors DL, ESL, DOBJ, BH, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section”

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS ONE	Publication Date: Jun 12, 2019
Citations: 26	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

On the impact of Citizen Science-derived data quality on deep learning based classification in marine images.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

Minimizing Data Waste: Conservation in the Big Data Era
Allison D Binley ... Gabriel Dansereau
The Bulletin of the Ecological Society of America | VOL. 104
Allison D Binley, et. al.Allison D Binley ... Gabriel Dansereau
10 Mar 2023
The Bulletin of the Ecological Society of America | VOL. 104

Citizen Science and Open Data: a model for Invasive Alien Species in Europe
Ana Cristina Cardoso ... Kyle Copas
Research Ideas and Outcomes | VOL. 3
Ana Cristina Cardoso, et. al.Ana Cristina Cardoso ... Kyle Copas
04 Jul 2017
Research Ideas and Outcomes | VOL. 3

Different facets of the same niche: Integrating citizen science and scientific survey data to predict biological invasion risk under multiple global change drivers.
Mirko Di Febbraro ... Gaetano Aloise
Global Change Biology | VOL. 29
Mirko Di Febbraro, et. al.Mirko Di Febbraro ... Gaetano Aloise
07 Aug 2023
Global Change Biology | VOL. 29

The Partnership of Citizen Science and Machine Learning: Benefits, Risks, and Future Challenges for Engagement, Data Collection, and Data Quality
Maryam Lotfian ... Jens Ingensand
Sustainability | VOL. 13
Maryam Lotfian, et. al.Maryam Lotfian ... Jens Ingensand
20 Jul 2021
Sustainability | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the impact of Citizen Science-derived data quality on deep learning based classification in marine images.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE