Audio Classification Based on MPEG-7 Spectral Basis Representations

H.-G Kim,T Sikora,N Moreau

doi:10.1109/tcsvt.2004.826766

Abstract

In this paper, we present an MPEG-7-based audio classification and retrieval technique targeted for analysis of film material. The technique consists of low-level descriptors and high-level description schemes. For low-level descriptors, low-dimensional features such as audio spectrum projection based on audio spectrum basis descriptors is produced in order to find a balanced tradeoff between reducing dimensionality and retaining maximum information content. High-level description schemes are used to describe the modeling of reduced-dimension features, the procedure of audio classification, and retrieval. A classifier based on continuous hidden Markov models is applied. The sound model state path, which is selected according to the maximum-likelihood model, is stored in an MPEG-7 sound database and used as an index for query applications. Various experiments are presented where the speaker- and sound-recognition rates are compared for different feature extraction methods. Using independent component analysis, we achieved better results than normalized audio spectrum envelope and principal component analysis in a speaker recognition system. In audio classification experiments, audio sounds are classified into selected sound classes in real time with an accuracy of 96%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Audio Classification Based on MPEG-7 Spectral Basis Representations

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: May 1, 2004
Citations: 97

Similar Papers

Speaker recognition using MPEG-7 descriptors
Hyoung-Gook Kim ... Nicolas Moreau
-
Hyoung-Gook Kim, et. al.Hyoung-Gook Kim ... Nicolas Moreau
01 Sep 2003
01 Sep 2003

Generalized multi-stream hidden Markov models.
Oualid Missaoui
-
Oualid MissaouiOualid Missaoui
12 Feb 2015
12 Feb 2015

Home environmental sound recognition based on MPEG-7 features
Jhing-Fa Wang ... Tze-Hsuan Huang
-
Jhing-Fa Wang, et. al. Jhing-Fa Wang ... Tze-Hsuan Huang
27 Dec 2003
27 Dec 2003

A Comparative Study of Feature Extraction and Classification Methods for Military Vehicle Type Recognition Using Acoustic and Seismic Signals
Hanguang Xiao ... Xinghua Liu
-
Hanguang Xiao, et. al.Hanguang Xiao ... Xinghua Liu
21 Aug 2007
21 Aug 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Audio Classification Based on MPEG-7 Spectral Basis Representations

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology