Speech emotion recognition with acoustic and lexical features

Qin Jin,Huimin Wu,Shizhe Chen,Chengxin Li

doi:10.1109/icassp.2015.7178872

Abstract

In this paper we explore one of the key aspects in building an emotion recognition system: generating suitable feature representations. We generate feature representations from both acoustic and lexical levels. At the acoustic level, we first extract low-level features such as intensity, F0, jitter, shimmer and spectral contours etc. We then generate different acoustic feature representations based on these low-level features, including statistics over these features, a new representation derived from a set of low-level acoustic codewords, and a new representation from Gaussian Supervectors. At the lexical level, we propose a new feature representation named emotion vector (eVector). We also use the traditional Bag-of-Words (BoW) feature. We apply these feature representations for emotion recognition and compare their performance on the USC-IEMOCAP database. We also combine these different feature representations via early fusion and late fusion. Our experimental results show that late fusion of both acoustic and lexical features achieves four-class emotion recognition accuracy of 69.2%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech emotion recognition with acoustic and lexical features

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speaker-dependent low-level acoustic feature extraction for emotion recognition
Tejal Udhan ... Shonda Bernadin
The Journal of the Acoustical Society of America | VOL. 143
Tejal Udhan, et. al.Tejal Udhan ... Shonda Bernadin
01 Mar 2018
The Journal of the Acoustical Society of America | VOL. 143

Affective Latent Representation of Acoustic and Lexical Features for Emotion Recognition.
Eesung Kim ... Hyungchan Song
Sensors | VOL. 20
Eesung Kim, et. al.Eesung Kim ... Hyungchan Song
04 May 2020
Sensors | VOL. 20

Emotion Recognition Combining Acoustic and Linguistic Features Based on Speech Recognition Results
Misaki Sakurai ... Tetsuo Kosaka
-
Misaki Sakurai, et. al.Misaki Sakurai ... Tetsuo Kosaka
12 Oct 2021
12 Oct 2021

Speech emotion classification using acoustic features
Shizhe Chen ... Qin Jin
-
Shizhe Chen, et. al.Shizhe Chen ... Qin Jin
01 Sep 2014
01 Sep 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech emotion recognition with acoustic and lexical features

Abstract

Talk to us

Similar Papers