Audio Content Classification Method Research Based on Two-step Strategy

Sumei Liang,Xinhua Fan

doi:10.14569/ijacsa.2014.050307

Abstract

Audio content classification is an interesting and significant issue. Audio classification technique has two basic parts: audio feature extraction and classifier. In general the audio content classification method is firstly to identify the original audio into text, then use the identified text to classify. But the text recognition rate is not high, some words that good for classification are identified by mistake causing that the classification effect is not ideal. In order to solve these problems above, this paper proposes a new effective audio classification method based on two-step strategy. In the first step the features are extracted by using the improved mutual information and classified with Naïve Bayes classifier. After classification of the first step, an unreliable area is determined, and samples with features in this area go on to be classified with the second step. In the second step, textual features extracted with CHI statistic method are used to build a text feature space model. Then audio features containing MFCC and frame energy are combined together with the text features to build a new feature vector space model. Finally, the new feature vector space model is classified using Support Vector Machine (SVM) classifier. The experiments show that the two-step strategy classification method for audio classification achieves great classification performance with the accuracy rate of 97.2%.

Highlights

With the high-speed development of information industry, the digital information grows rapidly
To aim of solving the above problems, we proposed a new audio classification method based on two-step strategy www.ijacsa.thesai.org (IJACSA) International Journal of Advanced Computer Science and Applications, Vol 5, No 3, 2014 combining text features and audio features
In the second step it combines audio features Mel frequency cepstrum coefficient (MFCC), frame energy with text features selected with CHI statistic formula as the total classify features, uses Support Vector Machine classification method to classify

Summary

INTRODUCTION

With the high-speed development of information industry, the digital information grows rapidly. Whether the texts after identification are the same as plain texts with the phenomena that most misclassified ones are in a fuzzy region constituted in the twodimensional space structure Whether audio features such as MFCC, frame power could be effective for audio content classification. Text features of texts identified from audio are fewer than features of the plain texts, and especially some words with excellent contribution to the classification are recognized by mistake These problems cause the classification effect in the first step is not ideal. In the second step it combines audio features MFCC, frame energy with text features selected with CHI statistic formula as the total classify features, uses Support Vector Machine classification method to classify. The final classification judgment is given according to the results of the two steps

CHARACTERISTIC ANALYSIS IN THE AUDIO CONTENT CLASSIFICATION

Frame Energy

Mel Frequency Cepstrum Coefficient

The Rewriting for two types of Naïve Bayes Classifier

The design of the Support Vector Machine SVM Classifier

The observations of misclassification samples in the first step

Evaluation index

The characteristics of the two-step classification method

The Experiment Database

Classification Method

CONCLUSION

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Advanced Computer Science and Applications	Publication Date: Jan 1, 2014
Citations: 10	License type: cc-by

R Discovery Prime

R Discovery Prime

Audio Content Classification Method Research Based on Two-step Strategy

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications

Lead the way for us

Similar Papers

Hybrid Independent Component Analysis and Rough Set Approach for Audio Feature Extraction
Xin He ... Ling Guo
-
Xin He, et. al.Xin He ... Ling Guo
01 Oct 2008
01 Oct 2008

Research on sound classification based on SVM
Pengcheng Wei ... Jing Li
Neural Computing & Applications | VOL. 32
Pengcheng Wei, et. al.Pengcheng Wei ... Jing Li
16 Apr 2019
Neural Computing & Applications | VOL. 32

Comparison of SVM, KNN, and NB Classifier for Genre Music Classification based on Metadata
De Rosal Ignatius Moses Setiadi ... Candra Irawan
-
De Rosal Ignatius Moses Setiadi, et. al.De Rosal Ignatius Moses Setiadi ... Candra Irawan
19 Sep 2020
19 Sep 2020

Classification of COVID-19 Cough Sounds using Mel Frequency Cepstral Coefficient (MFCC) Feature Extraction and Support Vector Machine
Muhammad Meftah Mafazy ... Dwi Kartini
Telematika | VOL. 16
Muhammad Meftah Mafazy, et. al.Muhammad Meftah Mafazy ... Dwi Kartini
15 Aug 2023
Telematika | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Audio Content Classification Method Research Based on Two-step Strategy

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications