Environment sound classification using an attention-based residual neural network

Achyut Mani Tripathi,Aakansha Mishra

doi:10.1016/j.neucom.2021.06.031

Abstract

Complexity of environmental sounds impose numerous challenges for their classification. The performance of Environmental Sound Classification (ESC) depends greatly on how good the feature extraction technique employed to extract generic and prototypical features from a sound is. The presence of silent and semantically irrelevant frames is ubiquitous during the classification of environmental sounds. To deal with such issues that persist in environmental sound classification, we introduce a novel attention-based deep model that supports focusing on semantically relevant frames. The proposed attention guided deep model efficiently learns spatio-temporal relationships that exist in the spectrogram of a signal. The efficacy of the proposed method is evaluated on two widely used Environmental Sound Classification datasets: ESC-10 and DCASE 2019 Task-1(A) datasets. The experiments performed and their results demonstrate that the proposed method yields comparable performance to state-of-the-art techniques. We obtained improvements of 11.50% and 19.50% in accuracy as compared to the accuracy of the baseline models of the ESC-10 and DCASE 2019 Task-1(A) datasets respectively. To support the attention outcomes that have focused on relevant regions, visual analysis of the attention feature map has also been presented. The resultant attention feature map conveys that the model focuses only on the spectrogram’s semantically relevant regions while skipping the irrelevant regions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Environment sound classification using an attention-based residual neural network

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Jun 17, 2021
Citations: 26

Similar Papers

Multi-Stream Network with Temporal Attention for Environmental Sound Classification
Xinyu Li ... Katrin Kirchhoff
-
Xinyu Li, et. al.Xinyu Li ... Katrin Kirchhoff
15 Sep 2019
15 Sep 2019

Reverse Adversarial Attack To Enhance Environmental Sound Classification
Achyut Mani Tripathi ... Swarup Ranjan Behera
-
Achyut Mani Tripathi, et. al.Achyut Mani Tripathi ... Swarup Ranjan Behera
18 Jul 2022
18 Jul 2022

Adv-ESC: Adversarial attack datasets for an environmental sound classification
Achyut Mani Tripathi ... Aakansha Mishra
Applied Acoustics | VOL. 185
Achyut Mani Tripathi, et. al.Achyut Mani Tripathi ... Aakansha Mishra
06 Oct 2021
Applied Acoustics | VOL. 185

Investigation of Performance of Visual Attention Mechanisms for Environmental Sound Classification: A Comparative Study
Achyut Mani Tripathi ... Konark Paul
-
Achyut Mani Tripathi, et. al.Achyut Mani Tripathi ... Konark Paul
18 Jul 2022
18 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Environment sound classification using an attention-based residual neural network

Abstract

Talk to us

Similar Papers

More From: Neurocomputing