Abstract

This paper presents a comparison and analysis of speech emotion recognition in the context of Arabic and English languages. Four emotions (neutral, sadness, happiness and anger) were considered from two speech corpora: the King Saud University Emotions (KSUEmotions) corpus for Arabic and the Emotional Prosody Speech and Transcripts (EPST) corpus for English. Six speakers (three men and three women) were selected from each corpus. Many acoustic features were extracted for use in the recognition and analysis stages. Additionally, an Analysis Of Variance (ANOVA) was used to determine which acoustic features should be used in our emotion recognition system. Results show that there is a benefit in terms of emotion recognition for Arabic words with the use of specific acoustic features. Results also show that certain speech features, such as the first three formants, help in the accuracy of emotion recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.