Large Acoustic Datasets Research Articles

Bioacoustics, the exploration of animal vocalizations and natural soundscapes, has emerged as a valuable tool for studying species within their habitats, particularly those that are challenging to observe. This approach has broadened the horizons of biodiversity assessment and ecological research. However, monitoring wildlife with acoustic recorders produces large volumes of data that can be labor-intensive to analyze. Deep learning has recently transformed many computational disciplines by enabling the automated processing of large and complex datasets and has gained attention within the bioacoustics community. Despite the revolutionary impact of deep learning on acoustic detection and classification, attaining both high detection accuracy and low false positive rates in bioacoustics remains a significant challenge. An intriguing yet unexplored avenue for enhancing deep learning in bioacoustics involves the utilization of contextual information, such as time and location, to discern animal vocalizations within acoustic recordings. As a first case study, a multi-branch Convolutional Neural Network (CNN) was developed to classify 22 different bird songs using spectrograms as a first input, and spatial metadata as a secondary input. A comparison was made to a baseline model with only spectrogram input. A geographical prior neural network was trained, separately, to estimate the probability of a species occurring at a given location. The output of this network was combined with the baseline CNN. As a second case study, temporal data and spectrograms were used as input to a multi-branch CNN for the detection of Hainan gibbon (Nomascus hainanus) calls, the world’s rarest primate. Our findings demonstrate that adding metadata to the bird song classifier significantly improves classification performance, with the highest improvement achieved using the geographical prior model (F1-score of 87.78% compared to 61.02% for the baseline model). The multi-branch CNNs also proved efficient (F1-scores of 76.87% and 78.77%) and simpler to use than the geographical prior. In the second case study, our findings revealed a decrease in false positives by 63% (94% of the calls were detected) when the metadata was used by the multi-branch CNN, and an increase of 19% in gibbon detection. This study has uncovered an exciting new avenue for improving classifier performance in bioacoustics. The methodology described in this study can assist ecologists, wildlife management teams, and researchers in reducing the amount of time spent analyzing large acoustic datasets obtained from passive acoustic monitoring studies. Our approach can be adapted and applied to other calling species, and thus tailored to other use cases.

Read full abstract

Passive acoustic monitoring is a powerful tool for monitoring vocally active taxa. Automated signal recognition software reduces the expert time needed for recording analyses and allows researchers and managers to manage large acoustic datasets. The application of state-of-the-art techniques for automated identification, such as Convolutional Neural Networks, may be challenging for ecologists and managers without informatics or engineering expertise. Here, we evaluated the use of AudioMoth — a low-cost and open-source sound recorder — to monitor a threatened and patchily distributed species, the Eurasian bittern (Botaurus stellaris). Passive acoustic monitoring was carried out across 17 potential wetlands in north Spain. We also assessed the performance of BirdNET — an automated and freely available classifier able to identify over 3000 bird species — and Kaleidoscope Pro — a user-friendly recognition software — to detect the vocalizations and the presence of the target species. The percentage of presences and vocalizations of the Eurasian bittern automatically detected by BirdNET and Kaleidoscope Pro software was compared to manual annotations of 205 recordings. The species was effectively recorded up to distances of 801–900 m, with at least 50% of the vocalizations uttered within that distance being manually detected; this distance was reduced to 601–700 m when considering the analyses carried out using Kaleidoscope Pro. BirdNET detected the species in 59 of the 63 (93.7%) recordings with known presence of the species, while Kaleidoscope detected the bittern in 62 recordings (98.4%). At the vocalization level, BirdNet and Kaleidoscope Pro were able to detect between 76 and 78%, respectively, of the vocalizations detected by a human observer. Our study highlights the ability of AudioMoth for detecting the bittern at large distances, which increases the potential of that technique for monitoring the species at large spatial scales. According to our results, a single AudioMoth could be useful for monitoring the species' presence in wetlands of up to 150 ha. Our study proves the utility of passive acoustic monitoring, coupled with BirdNET or Kaleidoscope Pro, as an accurate, repeatable, and cost-efficient method for monitoring the Eurasian bittern at large spatial and temporal scales. Nonetheless, further research should evaluate the performance of BirdNET on a larger number of species, and under different recording conditions (e.g., more closed habitats), to improve our knowledge about BirdNET's ability to perform bird monitoring. Future studies should also aim to develop an adequate protocol to perform effective passive acoustic monitoring of the Eurasian bittern.

Read full abstract

Large Acoustic Datasets Research Articles

Related Topics

Articles published on Large Acoustic Datasets

A post‐processing framework for assessing BirdNET identification accuracy and community composition

Automated detection of Bornean white-bearded gibbon (Hylobates albibarbis) vocalizations using an open-source framework for deep learning.

Potential of K-means clustering for preliminary labeling of acoustic data samples

Characteristics and spatiotemporal variation of sei whale (Balaenoptera borealis) downsweeps recorded in Atlantic Canada.

Improving deep learning acoustic classifiers with contextual information for wildlife monitoring

Artificial intelligence (BirdNET) supplements manual methods to maximize bird species richness from acoustic data sets generated from regional monitoring

Adapting deep learning models to new acoustic environments - A case study on the North Atlantic right whale upcall

Manual Versus Semiautomated Bioacoustic Analysis Methods of Multiple Vocalizations in Tricolored Blackbird Colonies

Passive acoustic exploration of the ocean

Silbido profundo: An open source package for the use of deep learning to detect odontocete whistles.

A Robust Method to Automatically Detect Fin Whale Acoustic Presence in Large and Diverse Passive Acoustic Datasets

Low-cost open-source recorders and ready-to-use machine learning approaches provide effective monitoring of threatened species

A new method employing species‐specific thresholding identifies acoustically overlapping bats

Automatic detection and classification of baleen and toothed whale calls via machine learning approaches over instantaneous wide areas in the Gulf of Maine received on a coherent hydrophone array

Detecting and reducing heterogeneity of error in acoustic classification

Echolocation click discrimination for three killer whale ecotypes in the Northeastern Pacific.

Development of deep neural networks for marine mammal call detection using an open-source, user friendly tool

PAMGuard: Open-source detection, classification, and Localization software

Caller ID for Risso\u2019s and Pacific White-sided dolphins

Using a Novel Visualization Tool for Rapid Survey of Long-Duration Acoustic Recordings for Ecological Studies of Frog Chorusing

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Large Acoustic Datasets Research Articles

Related Topics

Articles published on Large Acoustic Datasets

A post‐processing framework for assessing BirdNET identification accuracy and community composition

Automated detection of Bornean white-bearded gibbon (Hylobates albibarbis) vocalizations using an open-source framework for deep learning.

Potential of K-means clustering for preliminary labeling of acoustic data samples

Characteristics and spatiotemporal variation of sei whale (Balaenoptera borealis) downsweeps recorded in Atlantic Canada.

Improving deep learning acoustic classifiers with contextual information for wildlife monitoring

Artificial intelligence (BirdNET) supplements manual methods to maximize bird species richness from acoustic data sets generated from regional monitoring

Adapting deep learning models to new acoustic environments - A case study on the North Atlantic right whale upcall

Manual Versus Semiautomated Bioacoustic Analysis Methods of Multiple Vocalizations in Tricolored Blackbird Colonies

Passive acoustic exploration of the ocean

Silbido profundo: An open source package for the use of deep learning to detect odontocete whistles.

A Robust Method to Automatically Detect Fin Whale Acoustic Presence in Large and Diverse Passive Acoustic Datasets

Low-cost open-source recorders and ready-to-use machine learning approaches provide effective monitoring of threatened species

A new method employing species‐specific thresholding identifies acoustically overlapping bats

Automatic detection and classification of baleen and toothed whale calls via machine learning approaches over instantaneous wide areas in the Gulf of Maine received on a coherent hydrophone array

Detecting and reducing heterogeneity of error in acoustic classification

Echolocation click discrimination for three killer whale ecotypes in the Northeastern Pacific.

Development of deep neural networks for marine mammal call detection using an open-source, user friendly tool

PAMGuard: Open-source detection, classification, and Localization software

Caller ID for Risso\u2019s and Pacific White-sided dolphins

Using a Novel Visualization Tool for Rapid Survey of Long-Duration Acoustic Recordings for Ecological Studies of Frog Chorusing