Abstract

Speech emotion recognition is one of the important research topics in the field of multimedia processing and human-machine interface. To obtain the most influential features of the speech data for emotion recognition, in this paper, 64 statistical features of the speech signal including short-term energy, pitch, frame, format, and spectrum energy were extracted with speech emotion database. Mean Impact Value (MIV) and the improved Correlation-based Feature Selection (CFS) were employed to select the most influential feature set. BP neural network (BPNN) was used to identify the accuracy. The proposed MIV-CFS method selected the features related to speech emotion, with less recognition error, the recognition accuracy all higher than 88%, and the highest recognition accuracy is 91.61%.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.