Audio classification using braided convolutional neural networks

Harsh Sinha,Pawan K Ajmera,Vinayak Awasthi

doi:10.1049/iet-spr.2019.0381

Abstract

Convolutional neural networks (CNNs) work surprisingly well and have helped drastically enhance the state-of-the-art techniques in the domain of image classification. The unprecedented success motivated the application of CNNs to the domain of auditory data. Recent publications suggest hidden Markov models and deep neural networks for audio classification. This study aims to achieve audio classification by representing audio as spectrogram images and then use a CNN-based architecture for classification. This study presents an innovative strategy for a CNN-based neural architecture that learns a sparse representation imitating the receptive neurons in the primary auditory cortex in mammals. The feasibility of the proposed CNN-based neural architecture is assessed for audio classification tasks on standard benchmark datasets such as Google Speech Commands datasets (GSCv1 and GSCv2) and the UrbanSound8K dataset (US8K). The proposed CNN architecture, referred to as braided convolutional neural network, achieves 97.15, 95 and 91.9% average recognition accuracy on GSCv1, GSCv2 and US8 K datasets, respectively, outperforming other deep learning architectures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Audio classification using braided convolutional neural networks

Abstract

Talk to us

Similar Papers

More From: IET Signal Processing

Lead the way for us

Journal: IET Signal Processing	Publication Date: Sep 1, 2020
Citations: 22

Similar Papers

Characteristics of neurons in auditory cortex of monkeys performing a simple auditory task.
B E Pfingst ... T A O'Connor
Journal of neurophysiology | VOL. 45
B E Pfingst, et. al.B E Pfingst ... T A O'Connor
01 Jan 1981
Journal of neurophysiology | VOL. 45

Auditory Brain Development in Children With Hearing Loss – Part One
Jace Wolfe ... Joanna Smith
The Hearing Journal | VOL. 69
Jace Wolfe, et. al.Jace Wolfe ... Joanna Smith
01 Oct 2016
The Hearing Journal | VOL. 69

Reliability and Representational Bandwidth in the Auditory Cortex
Michael R Deweese ... Anthony M Zador
Neuron | VOL. 48
Michael R Deweese, et. al.Michael R Deweese ... Anthony M Zador
01 Nov 2005
Neuron | VOL. 48

Auditory Plasticity: Vocal Output Shapes Auditory Cortex
Andrew J King
Current Biology | VOL. 15
Andrew J KingAndrew J King
01 Jul 2005
Current Biology | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Audio classification using braided convolutional neural networks

Abstract

Talk to us

Similar Papers

More From: IET Signal Processing