Intelligent Audio Signal Processing for Detecting Rainforest Species Using Deep Learning

Rakesh Kumar,Shakeel Ahmed,Tushar Aggarwal,Abdulaziz Alhumam,Meenu Gupta

doi:10.32604/iasc.2022.019811

Abstract

Hearing a species in a tropical rainforest is much easier than seeing them. If someone is in the forest, he might not be able to look around and see every type of bird and frog that are there but they can be heard. A forest ranger might know what to do in these situations and he/she might be an expert in recognizing the different type of insects and dangerous species that are out there in the forest but if a common person travels to a rain forest for an adventure, he might not even know how to recognize these species, let alone taking suitable action against them. In this work, a model is built that can take audio signal as input, perform intelligent signal processing for extracting features and patterns, and output which type of species is present in the audio signal. The model works end to end and can work on raw input and a pipeline is also created to perform all the preprocessing steps on the raw input. In this work, different types of neural network architectures based on Long Short Term Memory (LSTM) and Convolution Neural Network (CNN) are tested. Both are showing reliable performance, CNN shows an accuracy of 95.62% and Log Loss of 0.21 while LSTM shows an accuracy of 93.12% and Log Loss of 0.17. Based on these results, it is shown that CNN performs better than LSTM in terms of accuracy while LSTM performs better than CNN in terms of Log Loss. Further, both of these models are combined to achieve high accuracy and low Log Loss. A combination of both LSTM and CNN shows an accuracy of 97.12% and a Log Loss of 0.16.

Highlights

A long time ago, Rainforest formed after tropical temperatures dropped down drastically when the Atlantic Ocean had widened enough to provide a warm, moist climate to the Amazon basin [1]
A total of 4727 audio signals were recorded and later this data was manually checked by experts and for ~1100 of these audio files, they found that their device detected correct species and information for these ~1100 files were stored in a separate file which contains the name of the audio file, species which is present in the audio, maximum and minimum frequency of the audio and the time at which a species was heard in that audio file
3.2.1 Long Short Term Memory This section uses LSTM analysis for the considered dataset. This model is created in such a way that it captures sequential info of frequency of the sound of a species (LSTM layer with 50 units) along and calculates average frequency over the sample (Global Average Pooling)

Summary

Introduction

A long time ago (i.e., more than 500 lakh years ago), Rainforest formed after tropical temperatures dropped down drastically when the Atlantic Ocean had widened enough to provide a warm, moist climate to the Amazon basin [1]. Rainforest includes a large variety of species which accounts for 50% of. People have tried to become experts in all kinds of species by going through the training program for a forest ranger. There are many species which are only found in tropical rainforest and few people know about them and so, even forest rangers are being deprived of this type of training. It takes years of real-world experience to become an expert but machine learning can help in automating this process and not let us worry about knowing all bits and pieces of every species. A model can be built that can act as a companion and recognize every species that is around us while roaming in a forest

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Intelligent Automation & Soft Computing	Publication Date: Jan 1, 2022
Citations: 5	License type: cc-by

R Discovery Prime

R Discovery Prime

Intelligent Audio Signal Processing for Detecting Rainforest Species Using Deep Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Intelligent Automation & Soft Computing

Lead the way for us

Similar Papers

Gujarati Task Oriented Dialogue Slot Tagging Using Deep Neural Network Models
Rachana Parikh ... Hiren Joshi
-
Rachana Parikh, et. al.Rachana Parikh ... Hiren Joshi
01 Jan 2020
01 Jan 2020

Predicting of air pollutant concentrations based on spatio-temporal attention convolutional LSTM networks
A Hmelnov ... P Jiang
-
A Hmelnov, et. al.A Hmelnov ... P Jiang
05 May 2021
05 May 2021

Deep-Learning-Based Sequence Causal Long-Term Recurrent Convolutional Network for Data Fusion Using Video Data
Daehyeon Jeon ... Min-Suk Kim
Electronics | VOL. 12
Daehyeon Jeon, et. al.Daehyeon Jeon ... Min-Suk Kim
24 Feb 2023
Electronics | VOL. 12

Vulnerability detection with deep learning
Fang Wu ... Jiqiang Liu
-
Fang Wu, et. al.Fang Wu ... Jiqiang Liu
01 Dec 2017
01 Dec 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Intelligent Audio Signal Processing for Detecting Rainforest Species Using Deep Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Intelligent Automation &amp; Soft Computing

More From: Intelligent Automation & Soft Computing