Improving emotion recognition using class-level spectral features

Dmitri Bitouk,Ani Nenkova,Ragini Verma

doi:10.21437/interspeech.2009-582

Abstract

Traditional approaches to automatic emotion recognition from speech typically make use of utterance level prosodic features. Still, a great deal of useful information about expressivity and emotion can be gained from segmental spectral features, which provide a more detailed description of the speech signal, or from measurements from specific regions of the utterance, such as the stressed vowels. Here we introduce a novel set of spectral features for emotion recognition: statistics of Mel-Frequency Spectral Coefficients computed over three phoneme type classes of interest: stressed vowels, unstressed vowels and consonants in the utterance. We investigate performance of our features in the task of speaker-independent emotion recognition using two publicly available datasets. Our experimental results clearly indicate that indeed both the richer set of spectral features and the differentiation between phoneme type classes are beneficial for the task. Classification accuracies are consistently higher for our features compared to prosodic features or utterance-level spectral features. Combination of our phoneme class features with prosodic features leads to even further improvement. Index Terms: emotion recognition

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving emotion recognition using class-level spectral features

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Class-level spectral features for emotion recognition
Dmitri Bitouk ... Ani Nenkova
Speech Communication | VOL. 52
Dmitri Bitouk, et. al.Dmitri Bitouk ... Ani Nenkova
23 Feb 2010
Speech Communication | VOL. 52

Discrimination Capability of Prosodic and Spectral Features for Emotional Speech Recognition
...
Electronics and Electrical Engineering | VOL. 18
, et. al. ...
12 Nov 2012
Electronics and Electrical Engineering | VOL. 18

A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
Yu Zhou ... Junfeng Li
IEICE Transactions on Information and Systems | VOL. E93-D
Yu Zhou, et. al.Yu Zhou ... Junfeng Li
01 Jan 2009
IEICE Transactions on Information and Systems | VOL. E93-D

Speech Emotion Recognition Using Both Spectral and Prosodic Features
Yu Zhou ... Jianping Zhang
-
Yu Zhou, et. al.Yu Zhou ... Jianping Zhang
01 Dec 2009
01 Dec 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving emotion recognition using class-level spectral features

Abstract

Talk to us

Similar Papers