Environmental sound classification using a regularized deep convolutional neural network with data augmentation

Zohaib Mushtaq,Shun-Feng Su

doi:10.1016/j.apacoust.2020.107389

Abstract

The adoption of the environmental sound classification (ESC) tasks increases very rapidly over recent years due to its broad range of applications in our daily routine life. ESC is also known as Sound Event Recognition (SER) which involves the context of recognizing the audio stream, related to various environmental sounds. Some frequent and common aspects like non-uniform distance between acoustic source and microphone, the difference in the framework, presence of numerous sounds sources in audio recordings and overlapping various sound events make this ESC problem much complex and complicated. This study is to employ deep convolutional neural networks (DCNN) with regularization and data enhancement with basic audio features that have verified to be efficient on ESC tasks. In this study, the performance of DCNN with max-pooling (Model-1) and without max-pooling (Model-2) function are examined. Three audio attribute extraction techniques, Mel spectrogram (Mel), Mel Frequency Cepstral Coefficient (MFCC) and Log-Mel, are considered for the ESC-10, ESC-50, and Urban sound (US8K) datasets. Furthermore, to avoid the risk of overfitting due to limited numbers of data, this study also introduces offline data augmentation techniques to enhance the used datasets with a combination of L2 regularization. The performance evaluation illustrates that the best accuracy attained by the proposed DCNN without max-pooling function (Model-2) and using Log-Mel audio feature extraction on those augmented datasets. For ESC-10, ESC-50 and US8K, the highest achieved accuracies are 94.94%, 89.28%, and 95.37% respectively. The experimental results show that the proposed approach can accomplish the best performance on environment sound classification problems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Environmental sound classification using a regularized deep convolutional neural network with data augmentation

Abstract

Talk to us

Similar Papers

More From: Applied Acoustics

Lead the way for us

Journal: Applied Acoustics	Publication Date: May 5, 2020
Citations: 111

Similar Papers

Deep Learning-based Environmental Sound Classification Using Feature Fusion and Data Enhancement
Rashid Jahangir ... Sabah M Alzahrani
Computers, Materials & Continua | VOL. 74
Rashid Jahangir, et. al.Rashid Jahangir ... Sabah M Alzahrani
01 Jan 2023
Computers, Materials & Continua | VOL. 74

Emergency Detection with Environment Sound Using Deep Convolutional Neural Networks
Jivitesh Sharma ... Morten Goodwin
-
Jivitesh Sharma, et. al.Jivitesh Sharma ... Morten Goodwin
01 Oct 2020
01 Oct 2020

Generative Model Driven Representation Learning in a Hybrid Framework for Environmental Audio Scene and Sound Event Recognition
S Chandrakala ... S L Jayalakshmi
IEEE Transactions on Multimedia | VOL. 22
S Chandrakala, et. al.S Chandrakala ... S L Jayalakshmi
23 Jul 2019
IEEE Transactions on Multimedia | VOL. 22

Environment Sound Classification Using a Two-Stream CNN Based on Decision-Level Fusion.
Yu Su ... Ke Zhang
Sensors | VOL. 19
Yu Su, et. al.Yu Su ... Ke Zhang
11 Apr 2019
Sensors | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Environmental sound classification using a regularized deep convolutional neural network with data augmentation

Abstract

Talk to us

Similar Papers

More From: Applied Acoustics