Abstract

Audio content classification is an interesting and significant issue. Audio classification technique has two basic parts: audio feature extraction and classifier. In general the audio content classification method is firstly to identify the original audio into text, then use the identified text to classify. But the text recognition rate is not high, some words that good for classification are identified by mistake causing that the classification effect is not ideal. In order to solve these problems above, this paper proposes a new effective audio classification method based on two-step strategy. In the first step the features are extracted by using the improved mutual information and classified with Naïve Bayes classifier. After classification of the first step, an unreliable area is determined, and samples with features in this area go on to be classified with the second step. In the second step, textual features extracted with CHI statistic method are used to build a text feature space model. Then audio features containing MFCC and frame energy are combined together with the text features to build a new feature vector space model. Finally, the new feature vector space model is classified using Support Vector Machine (SVM) classifier. The experiments show that the two-step strategy classification method for audio classification achieves great classification performance with the accuracy rate of 97.2%.

Highlights

  • With the high-speed development of information industry, the digital information grows rapidly

  • To aim of solving the above problems, we proposed a new audio classification method based on two-step strategy www.ijacsa.thesai.org (IJACSA) International Journal of Advanced Computer Science and Applications, Vol 5, No 3, 2014 combining text features and audio features

  • In the second step it combines audio features Mel frequency cepstrum coefficient (MFCC), frame energy with text features selected with CHI statistic formula as the total classify features, uses Support Vector Machine classification method to classify

Read more

Summary

INTRODUCTION

With the high-speed development of information industry, the digital information grows rapidly. Whether the texts after identification are the same as plain texts with the phenomena that most misclassified ones are in a fuzzy region constituted in the twodimensional space structure Whether audio features such as MFCC, frame power could be effective for audio content classification. Text features of texts identified from audio are fewer than features of the plain texts, and especially some words with excellent contribution to the classification are recognized by mistake These problems cause the classification effect in the first step is not ideal. In the second step it combines audio features MFCC, frame energy with text features selected with CHI statistic formula as the total classify features, uses Support Vector Machine classification method to classify. The final classification judgment is given according to the results of the two steps

CHARACTERISTIC ANALYSIS IN THE AUDIO CONTENT CLASSIFICATION
Frame Energy
Mel Frequency Cepstrum Coefficient
The Rewriting for two types of Naïve Bayes Classifier
The design of the Support Vector Machine SVM Classifier
The observations of misclassification samples in the first step
Evaluation index
The characteristics of the two-step classification method
The Experiment Database
Classification Method
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call