Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

Justin Salamon,Juan Pablo Bello

doi:10.1109/lsp.2017.2657381

Justin Salamon, Juan Pablo Bello

Open Access

https://doi.org/10.1109/lsp.2017.2657381

Copy DOI

Journal: IEEE Signal Processing Letters	Publication Date: Mar 1, 2017
Citations: 1234	License type: publisher-specific, author manuscript

Affiliation: New York University

Abstract

The ability of deep convolutional neural networks (CNNs) to learn discriminative spectro-temporal patterns makes them well suited to environmental sound classification. However, the relative scarcity of labeled data has impeded the exploitation of this family of high-capacity models. This study has two primary contributions: first, we propose a deep CNN architecture for environmental sound classification. Second, we propose the use of audio data augmentation for overcoming the problem of data scarcity and explore the influence of different augmentations on the performance of the proposed CNN architecture. Combined with data augmentation, the proposed model produces state-of-the-art results for environmental sound classification. We show that the improved performance stems from the combination of a deep, high-capacity model and an augmented training set: this combination outperforms both the proposed CNN without augmentation and a “shallow” dictionary learning model with augmentation. Finally, we examine the influence of each augmentation on the model's classification accuracy for each class, and observe that the accuracy for each class is influenced differently by each augmentation, suggesting that the performance of the model could be improved further by applying class-conditional data augmentation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters

Lead the way for us

Similar Papers

A Method of Environmental Sound Classification Based on Residual Networks and Data Augmentation
Jinfang Zeng ... Youming Li
International Journal of Computational Intelligence and Applications | VOL. 20
Jinfang Zeng, et. al.Jinfang Zeng ... Youming Li
13 Aug 2021
International Journal of Computational Intelligence and Applications | VOL. 20

Environmental Sound Classification Using Deep Convolutional Neural Networks and Data Augmentation
Nithya Davis ... K Suresh
-
Nithya Davis, et. al.Nithya Davis ... K Suresh
01 Dec 2018
01 Dec 2018

Clinically Relevant Vulnerabilities of Deep Machine Learning Systems for Skin Cancer Diagnosis
Xinyi Du-Harpur ... Magnus D Lynch
Journal of Investigative Dermatology | VOL. 141
Xinyi Du-Harpur, et. al.Xinyi Du-Harpur ... Magnus D Lynch
12 Sep 2020
Journal of Investigative Dermatology | VOL. 141

CNN-RNN and Data Augmentation Using Deep Convolutional Generative Adversarial Network for Environmental Sound Classification
Behnaz Bahmei ... Siamak Arzanpour
IEEE Signal Processing Letters | VOL. 29
Behnaz Bahmei, et. al.Behnaz Bahmei ... Siamak Arzanpour
01 Jan 2021
IEEE Signal Processing Letters | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters