Abstract

Feature extraction is the key to the object recognition. How to obtain effective, reliable characteristic parameters from the limited measured data is a question of great importance in feature extraction. This paper presents a method based on Empirical Mode Decomposition (EMD) for the extraction of Mel Frequency Cepstrum Coefficients (MFCCs) and its first order difference from original speech signals that contain four kinds of emotions such as anger, happiness, surprise and natural for emotion recognition. And the experiments compare the recognition rate of MFCC, differential MFCC (Both of them are extracted based on EMD) or their combination through using Support Vector Machine (SVM) to recognize speakers' emotional speech identity. It proves that the combination of MFCC and its first order difference has a highest recognition rate.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call