Acoustic Classification of Bird Species Using an Early Fusion of Deep Features

Jie Xie,Mingying Zhu

doi:10.3390/birds4010011

Abstract

Bird sound classification plays an important role in large-scale temporal and spatial environmental monitoring. In this paper, we investigate both transfer learning and training from scratch for bird sound classification, where pre-trained models are used as feature extractors. Specifically, deep cascade features are extracted from various layers of different pre-trained models, which are then fused to classify bird sounds. A multi-view spectrogram is constructed to characterize bird sounds by simply repeating the spectrogram to make it suitable for pre-trained models. Furthermore, both mixup and pitch shift are applied for augmenting bird sounds to improve the classification performance. Experimental classification on 43 bird species using linear SVM indicates that deep cascade features can achieve the highest balanced accuracy of 90.94% ± 1.53%. To further improve the classification performance, an early fusion method is used by combining deep cascaded features extracted from different pre-trained models. The final best classification balanced accuracy is 94.89% ± 1.35%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Birds	Publication Date: Mar 1, 2023
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Acoustic Classification of Bird Species Using an Early Fusion of Deep Features

Abstract

Talk to us

Similar Papers

More From: Birds

Lead the way for us

Similar Papers

Comparison of early and late fusion techniques for movie trailer genre labelling
J.H Mervitz ... J.P De Villiers
-
J.H Mervitz, et. al.J.H Mervitz ... J.P De Villiers
01 Jul 2020
01 Jul 2020

One-step progressive representation transfer learning for bird sound classification
Chengyun Zhang ... Xinghui Gao
Applied Acoustics | VOL. 212
Chengyun Zhang, et. al.Chengyun Zhang ... Xinghui Gao
01 Sep 2023
Applied Acoustics | VOL. 212

Deep Feature Fusion and Optimization-Based Approach for Stomach Disease Classification.
Farah Mohammad ... Muna Al-Razgan
Sensors (Basel, Switzerland) | VOL. 22
Farah Mohammad, et. al.Farah Mohammad ... Muna Al-Razgan
06 Apr 2022
Sensors (Basel, Switzerland) | VOL. 22

Unsound wheat kernel recognition based on deep convolutional neural network transfer learning and feature fusion
Qinghui Zhang ... Yong Wu
Journal of Intelligent & Fuzzy Systems | VOL. 43
Qinghui Zhang, et. al.Qinghui Zhang ... Yong Wu
22 Sep 2022
Journal of Intelligent & Fuzzy Systems | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Acoustic Classification of Bird Species Using an Early Fusion of Deep Features

Abstract

Talk to us

Similar Papers

More From: Birds