Emotion recognition from speech using source, system, and prosodic features

Shashidhar G Koolagudi,K Sreenivasa Rao

doi:10.1007/s10772-012-9139-3

Abstract

In this work, source, system, and prosodic features of speech are explored for characterizing and classifying the underlying emotions. Different speech features contribute in different ways to express the emotions, due to their complementary nature. Linear prediction residual samples chosen around glottal closure regions, and glottal pulse parameters are used to represent excitation source information. Linear prediction cepstral coefficients extracted through simple block processing and pitch synchronous analysis represent the vocal tract information. Global and local prosodic features extracted from gross statistics and temporal dynamics of the sequence of duration, pitch, and energy values represent the prosodic information. Emotion recognition models are developed using above mentioned features separately, and in combination. Simulated Telugu emotion database (IITKGP-SESC) is used to evaluate the proposed features. The emotion recognition results obtained using IITKGP-SESC are compared with the results of internationally known Berlin emotion speech database (Emo-DB). Autoassociative neural networks, Gaussian mixture models, and support vector machines are used to develop emotion recognition systems with source, system, and prosodic features, respectively. Weighted combination of evidence has been used while combining the performance of systems developed using different features. From the results, it is observed that, each of the proposed speech features has contributed toward emotion recognition. The combination of features improved the emotion recognition performance, indicating the complementary nature of the features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Emotion recognition from speech using source, system, and prosodic features

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology

Lead the way for us

Journal: International Journal of Speech Technology	Publication Date: Mar 20, 2012
Citations: 107

Similar Papers

How does real affect affect affect recognition in speech?
Khiet Truong
-
Khiet TruongKhiet Truong
12 May 2017
12 May 2017

A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
Yu Zhou ... Junfeng Li
IEICE Transactions on Information and Systems | VOL. E93-D
Yu Zhou, et. al.Yu Zhou ... Junfeng Li
01 Jan 2009
IEICE Transactions on Information and Systems | VOL. E93-D

Improvement of phone recognition accuracy using speech mode classification
Kumud Tripathi ... K Sreenivasa Rao
International Journal of Speech Technology | VOL. 21
Kumud Tripathi, et. al.Kumud Tripathi ... K Sreenivasa Rao
07 Dec 2017
International Journal of Speech Technology | VOL. 21

Emotion recognition using LP residual
Arun Chauhan ... K Sreenivasa Rao
-
Arun Chauhan, et. al. Arun Chauhan ... K Sreenivasa Rao
01 Apr 2010
01 Apr 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Emotion recognition from speech using source, system, and prosodic features

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology