Pattern classification models for classifying and indexing audio signals

P Dhanalakshmi,S Palanivel,V Ramalingam

doi:10.1016/j.engappai.2010.10.011

Abstract

In the age of digital information, audio data has become an important part in many modern computer applications. Audio classification and indexing has been becoming a focus in the research of audio processing and pattern recognition. In this paper, we propose effective algorithms to automatically classify audio clips into one of six classes: music, news, sports, advertisement, cartoon and movie. For these categories a number of acoustic features that include linear predictive coefficients, linear predictive cepstral coefficients and mel-frequency cepstral coefficients are extracted to characterize the audio content. The autoassociative neural network model (AANN) is used to capture the distribution of the acoustic feature vectors. Then the proposed method uses a Gaussian mixture model (GMM)-based classifier where the feature vectors from each class were used to train the GMM models for those classes. During testing, the likelihood of a test sample belonging to each model is computed and the sample is assigned to the class whose model produces the highest likelihood. Audio clip extraction, feature extraction, creation of index, and retrieval of the query clip are the major issues in automatic audio indexing and retrieval. A method for indexing the classified audio using LPCC features and k-means clustering algorithm is proposed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Pattern classification models for classifying and indexing audio signals

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence

Lead the way for us

Journal: Engineering Applications of Artificial Intelligence	Publication Date: Nov 13, 2010
Citations: 16

Similar Papers

Classification of audio signals using AANN and GMM
P Dhanalakshmi ... V Ramalingam
Applied Soft Computing | VOL. 11
P Dhanalakshmi, et. al.P Dhanalakshmi ... V Ramalingam
06 Jan 2010
Applied Soft Computing | VOL. 11

Classification of audio signals using SVM and RBFNN
P Dhanalakshmi ... V Ramalingam
Expert Systems with Applications | VOL. 36
P Dhanalakshmi, et. al.P Dhanalakshmi ... V Ramalingam
03 Jul 2008
Expert Systems with Applications | VOL. 36

Real-time prediction of upcoming respiratory events via machine learning using snoring sound signal.
Bochun Wang ... Ji Wu
Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine | VOL. 17
Bochun Wang, et. al.Bochun Wang ... Ji Wu
12 Apr 2021
Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine | VOL. 17

Robust Speaker Identification System Based on Wavelet Transform and Gaussian Mixture Model
Wan-Chen Chen ... Ching-Tang Hsieh
-
Wan-Chen Chen, et. al.Wan-Chen Chen ... Ching-Tang Hsieh
01 Jan 2004
01 Jan 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Pattern classification models for classifying and indexing audio signals

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence