Abstract

Emotion plays a significant role in identifying the states of a speaker using spoken utterances. Prosodic features add sense in spoken utterances providing speaker emotions. The objective of this research is to analyze the behavior of prosodic features (individual and in combination with others’ prosodic features) with different learning classifiers on emotion based utterances of children in the Urdu language. In this paper, three different prosodic features (intensity, pitch, formant and their combinations) with five different learning classifiers(ANN, J-48, K-star, Naïve Bayes, decision stump) and four basic emotions (happy, sad, angry, and neutral) were used to develop the experimental framework. Demonstrative experiments expressed that, in terms of classification accuracy, artificial neural networks show significant results with both individual and combination of prosodic features in comparison with other learning classifiers.

Highlights

  • AND RELATED WORKSpeech is commonly known as an effective way of communication between human beings [1]

  • The extraction process of prosodic features from spoken speech emotion utterances in regional langue Urdu are shown in Figures 1 to 3

  • These observations are demonstrating the behavior of all three prosodic features which are extracted by using PRAAT software [1]

Read more

Summary

Introduction

AND RELATED WORKSpeech is commonly known as an effective way of communication between human beings [1]. Speech emotion recognition review and analysis are presented in [8] by the use of different learning classifiers. Extraction of both local and global prosodic features from sentences are addressed in [9], different words and syllables are suggested for analyzing the speech recognition or affect recognition. Five different emotions have been investigated in [11] which are associated with acoustic properties of the prosody of speech This investigation comes to a result that those speeches which are associated with emotion “love” and “sad” are identified by higher pitch and utterances with lengthier duration. Similar to [11], prosody is recognized as the most fundamental feature of emotional expression in any specific speech in [12]

Objectives
Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.