Automatic Classification of Bird Sounds: Using MFCC and Mel Spectrogram Features with Deep Learning

Silvestre Carvalho,Elsa Ferreira Gomes

doi:10.1142/s2196888822500300

Silvestre Carvalho, Elsa Ferreira Gomes

Open Access

https://doi.org/10.1142/s2196888822500300

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Bird species identification is a relevant and time-consuming task for ornithologists and ecologists. With growing amounts of audio-annotated data, automatic bird classification using machine learning techniques is an important trend in the scientific community. Analyzing bird behavior and population trends helps detect other organisms in the environment and is an important problem in ecology. Bird populations react quickly to environmental changes, which make their real-time counting and tracking challenging and very useful. A reliable methodology that automatically identifies bird species from audio would therefore be a valuable tool for the experts in different scientific and applicational domains. The goal of this work is to propose a methodology to identify bird sounds. In this paper, we explore deep learning techniques that are being used in this domain, such as Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) to classify the data. In deep learning, audio problems are commonly approached by converting them into images using audio feature extraction techniques such as Mel Spectrograms and Mel Frequency Cepstral Coefficients (MFCCs). We propose and test multiple deep learning and feature extraction combinations in order to find the most suitable approach to this problem.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Vietnam Journal of Computer Science	Publication Date: Aug 10, 2022
Citations: 6	License type: cc-by

R Discovery Prime

Automatic Classification of Bird Sounds: Using MFCC and Mel Spectrogram Features with Deep Learning

Abstract

Published Version

Talk to us

Similar Papers

More From: Vietnam Journal of Computer Science

Lead the way for us

Similar Papers

Spoken Language Identification System Using Convolutional Recurrent Neural Network
Adal A Alashban ... Mustafa A Qamhan
Applied Sciences | VOL. 12
Adal A Alashban, et. al.Adal A Alashban ... Mustafa A Qamhan
13 Sep 2022
Applied Sciences | VOL. 12

Recognition of Musical Instrument Using Deep Learning Techniques
Sangeetha Rajesh ... Nalini N J
International Journal of Information Retrieval Research | VOL. 11
Sangeetha Rajesh, et. al.Sangeetha Rajesh ... Nalini N J
01 Oct 2021
International Journal of Information Retrieval Research | VOL. 11

Analyzing Noise Robustness of Cochleogram and Mel Spectrogram Features in Deep Learning Based Speaker Recognition
Wondimu Lambamo ... Worku Jifara
Applied Sciences | VOL. 13
Wondimu Lambamo, et. al.Wondimu Lambamo ... Worku Jifara
31 Dec 2022
Applied Sciences | VOL. 13

Combined Evidence of MFCC and CRP Features Using Machine Learning Algorithms for Singer Identification
Sangeetha Rajesh ... N J Nalini
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 35
Sangeetha Rajesh, et. al.Sangeetha Rajesh ... N J Nalini
29 Jul 2020
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Automatic Classification of Bird Sounds: Using MFCC and Mel Spectrogram Features with Deep Learning

Abstract

Published Version

Talk to us

Similar Papers

More From: Vietnam Journal of Computer Science