Abstract

Classification of environmental sounds plays a key role in security, investigation, robotics since the study of the sounds present in a specific environment can allow to get significant insights. Lack of standardized methods for an automatic and effective environmental sound classification (ESC) creates a need to be urgently satisfied. As a response to this limitation, in this paper, a hybrid model for automatic and accurate classification of environmental sounds is proposed. Optimum allocation sampling (OAS) is used to elicit the informative samples from each class. The representative samples obtained by OAS are turned into the spectrogram containing their time-frequency-amplitude representation by using a short-time Fourier transform (STFT). The spectrogram is then given as an input to pre-trained AlexNet and Visual Geometry Group (VGG)-16 networks. Multiple deep features are extracted using the pre-trained networks and classified by using multiple classification techniques namely decision tree (fine, medium, coarse kernel), k-nearest neighbor (fine, medium, cosine, cubic, coarse and weighted kernel), support vector machine, linear discriminant analysis, bagged tree and softmax classifiers. The ESC-10, a ten-class environmental sound dataset, is used for the evaluation of the methodology. An accuracy of 90.1%, 95.8%, 94.7%, 87.9%, 95.6%, and 92.4% is obtained with a decision tree, k-neared neighbor, support vector machine, linear discriminant analysis, bagged tree and softmax classifier respectively. The proposed method proved to be robust, effective, and promising in comparison with other existing state-of-the-art techniques, using the same dataset.

Highlights

  • Environment sound is due to numerous sources present in the environment, such as living beings, non-living objects and artificial entities created by humans

  • The method proposed by Ahmad et al achieved an accuracy of 77.7% and 87.25% with extreme learning machine (ELM) and LS-support vector machine (SVM), respectively

  • An adaptive, robust, and effective methodology is needed for correct classification of environmental signals

Read more

Summary

Introduction

Environment sound is due to numerous sources present in the environment, such as living beings, non-living objects and artificial entities created by humans. Numerous studies sand research contributions in the area of environment sound classification (ESC) One such important work on ESC helps detecting natural sounds of environment successfully [3], by using optimum allocation sampling and employing various support vector machine (SVM) and extreme machine learning based classifiers, which operate on OAS-EMD features. A four convolution layer based deep learning network has been used in [6], that utilized Dempster-Shafer evidence theory to produce classification.

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call