Classification of audio signals using AANN and GMM

P Dhanalakshmi,S Palanivel,V Ramalingam

doi:10.1016/j.asoc.2009.12.033

Abstract

Today, digital audio applications are part of our everyday lives. Audio classification can provide powerful tools for content management. If an audio clip automatically can be classified it can be stored in an organised database, which can improve the management of audio dramatically. In this paper, we propose effective algorithms to automatically classify audio clips into one of six classes: music, news, sports, advertisement, cartoon and movie. For these categories a number of acoustic features that include linear predictive coefficients, linear predictive cepstral coefficients and mel-frequency cepstral coefficients are extracted to characterize the audio content. The autoassociative neural network model (AANN) is used to capture the distribution of the acoustic feature vectors. The AANN model captures the distribution of the acoustic features of a class, and the backpropagation learning algorithm is used to adjust the weights of the network to minimize the mean square error for each feature vector. The proposed method also compares the performance of AANN with a Gaussian mixture model (GMM) wherein the feature vectors from each class were used to train the GMM models for those classes. During testing, the likelihood of a test sample belonging to each model is computed and the sample is assigned to the class whose model produces the highest likelihood.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Classification of audio signals using AANN and GMM

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Journal: Applied Soft Computing	Publication Date: Jan 6, 2010
Citations: 73

Similar Papers

Pattern classification models for classifying and indexing audio signals
P Dhanalakshmi ... V Ramalingam
Engineering Applications of Artificial Intelligence | VOL. 24
P Dhanalakshmi, et. al.P Dhanalakshmi ... V Ramalingam
13 Nov 2010
Engineering Applications of Artificial Intelligence | VOL. 24

Classification of audio signals using SVM and RBFNN
P Dhanalakshmi ... V Ramalingam
Expert Systems with Applications | VOL. 36
P Dhanalakshmi, et. al.P Dhanalakshmi ... V Ramalingam
03 Jul 2008
Expert Systems with Applications | VOL. 36

Real-time prediction of upcoming respiratory events via machine learning using snoring sound signal.
Bochun Wang ... Ji Wu
Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine | VOL. 17
Bochun Wang, et. al.Bochun Wang ... Ji Wu
12 Apr 2021
Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine | VOL. 17

Robust Speaker Identification System Based on Wavelet Transform and Gaussian Mixture Model
Wan-Chen Chen ... Ching-Tang Hsieh
-
Wan-Chen Chen, et. al.Wan-Chen Chen ... Ching-Tang Hsieh
01 Jan 2004
01 Jan 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Classification of audio signals using AANN and GMM

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing