Emotion recognition in spontaneous and acted dialogues

Leimin Tian,Catherine Lai,Johanna D Moore

doi:10.1109/acii.2015.7344645

Abstract

In this work, we compare emotion recognition on two types of speech: spontaneous and acted dialogues. Experiments were conducted on the AVEC2012 database of spontaneous dialogues and the IEMOCAP database of acted dialogues. We studied the performance of two types of acoustic features for emotion recognition: knowledge-inspired disfluency and nonverbal vocalisation (DIS-NV) features, and statistical Low-Level Descriptor (LLD) based features. Both Support Vector Machines (SVM) and Long Short-Term Memory Recurrent Neural Networks (LSTM-RNN) were built using each feature set on each emotional database. Our work aims to identify aspects of the data that constrain the effectiveness of models and features. Our results show that the performance of different types of features and models is influenced by the type of dialogue and the amount of training data. Because DIS-NVs are less frequent in acted dialogues than in spontaneous dialogues, the DIS-NV features perform better than the LLD features when recognizing emotions in spontaneous dialogues, but not in acted dialogues. The LSTM-RNN model gives better performance than the SVM model when there is enough training data, but the complex structure of a LSTM-RNN model may limit its performance when there is less training data available, and may also risk over-fitting. Additionally, we find that long distance contexts may be more useful when performing emotion recognition at the word level than at the utterance level.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Emotion recognition in spontaneous and acted dialogues

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Semi-Supervised Training in Deep Learning Acoustic Model
Yan Huang ... Yifan Gong
-
Yan Huang, et. al.Yan Huang ... Yifan Gong
08 Sep 2016
08 Sep 2016

SAE+LSTM: A New Framework for Emotion Recognition From Multi-Channel EEG.
Xiaofen Xing ... Bin Hu
Frontiers in Neurorobotics | VOL. 13
Xiaofen Xing, et. al.Xiaofen Xing ... Bin Hu
12 Jun 2019
Frontiers in Neurorobotics | VOL. 13

Interpretable, highly accurate brain decoding of subtly distinct brain states from functional MRI using intrinsic functional networks and long short-term memory recurrent neural networks
Hongming Li ... Yong Fan
NeuroImage | VOL. 202
Hongming Li, et. al.Hongming Li ... Yong Fan
27 Jul 2019
NeuroImage | VOL. 202

Audio and face video emotion recognition in the wild using deep neural networks and small datasets
Wan Ding ... Mingyu Xu
-
Wan Ding, et. al.Wan Ding ... Mingyu Xu
31 Oct 2016
31 Oct 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Emotion recognition in spontaneous and acted dialogues

Abstract

Talk to us

Similar Papers