A pipeline for identification of bird and frog species in tropical soundscape recordings using a convolutional neural network

Jack Lebien,Ming Zhong,Marconi Campos-Cerqueira,Julian P Velev,Rahul Dodhia,Juan Lavista Ferres,T Mitchell Aide

doi:10.1016/j.ecoinf.2020.101113

Jack Lebien, Ming Zhong + Show 5 more

Open Access

https://doi.org/10.1016/j.ecoinf.2020.101113

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Automated acoustic recorders can collect long-term soundscape data containing species-specific signals in remote environments. Ecologists have increasingly used them for studying diverse fauna around the globe. Deep learning methods have gained recent attention for automating the process of species identification in soundscape recordings. We present an end-to-end pipeline for training a convolutional neural network (CNN) for multi-species multi-label classification of soundscape recordings, starting from raw, unlabeled audio. Training data for species-specific signals are collected using a semi-automated procedure consisting of an efficient template-based signal detection algorithm and a graphical user interface for rapid detection validation. A CNN is then trained based on mel-spectrograms of sound to predict the set of species present in a recording. Transfer learning of a pre-trained model is employed to reduce the necessary training data and time. Furthermore, we define a loss function that allows for using true and false template-based detections to train a multi-class multi-label audio classifier. This approach leverages relevant absence (negative) information in training, and reduces the effort in creating multi-label training data by allowing weak labels. We evaluated the pipeline using a set of soundscape recordings collected across 749 sites in Puerto Rico. A CNN model was trained to identify 24 regional species of birds and frogs. The semi-automated training data collection process greatly reduced the manual effort required for training. The model was evaluated on an excluded set of 1000 randomly sampled 1-min soundscapes from 17 sites in the El Yunque National Forest. The test recordings contained an average of ~3 present target species per recording, and a maximum of 8. The test set also showed a large class imbalance with most species being present in less than 5% of recordings, and others present in >25%. The model achieved a mean-average-precision of 0.893 across the 24 species. Across all predictions, the total average-precision was 0.975.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Ecological Informatics	Publication Date: Jun 8, 2020
Citations: 92	License type: cc-by-nc-nd

R Discovery Prime

A pipeline for identification of bird and frog species in tropical soundscape recordings using a convolutional neural network

Abstract

Published Version

Talk to us

Similar Papers

More From: Ecological Informatics

Lead the way for us

Similar Papers

Deep learning-based classification and mutation prediction from histopathological images of hepatocellular carcinoma.
Haotian Liao ... Xuefeng Li
Clinical and Translational Medicine | VOL. 10
Haotian Liao, et. al.Haotian Liao ... Xuefeng Li
01 Jun 2020
Clinical and Translational Medicine | VOL. 10

Estimation and uncertainty analysis of groundwater quality parameters in a coastal aquifer under seawater intrusion: a comparative study of deep learning and classic machine learning methods.
Mehmet Taşan ... Sevda Taşan
Environmental Science and Pollution Research | VOL. 30
Mehmet Taşan, et. al.Mehmet Taşan ... Sevda Taşan
08 Aug 2022
Environmental Science and Pollution Research | VOL. 30

The Real-Time Mobile Application for Classifying of Endangered Parrot Species Using the CNN Models Based on Transfer Learning
Daegyu Choe ... Dong Keun Kim
Mobile Information Systems | VOL. 2020
Daegyu Choe, et. al.Daegyu Choe ... Dong Keun Kim
09 Mar 2020
Mobile Information Systems | VOL. 2020

An Investigation of Deep Learning Models for EEG-Based Emotion Recognition.
Yaqing Zhang ... Xin Huang
Frontiers in Neuroscience | VOL. 14
Yaqing Zhang, et. al.Yaqing Zhang ... Xin Huang
23 Dec 2020
Frontiers in Neuroscience | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

A pipeline for identification of bird and frog species in tropical soundscape recordings using a convolutional neural network

Abstract

Published Version

Talk to us

Similar Papers

More From: Ecological Informatics