Combining Long Short-Term Memory and Dynamic Bayesian Networks for Incremental Emotion-Sensitive Artificial Listening

Martin Wollmer,Gerhard Rigoll,Florian Eyben,Björn Schuller

doi:10.1109/jstsp.2010.2057200

Abstract

The automatic estimation of human affect from the speech signal is an important step towards making virtual agents more natural and human-like. In this paper, we present a novel technique for incremental recognition of the user's emotional state as it is applied in a sensitive artificial listener (SAL) system designed for socially competent human-machine communication. Our method is capable of using acoustic, linguistic, as well as long-range contextual information in order to continuously predict the current quadrant in a two-dimensional emotional space spanned by the dimensions valence and activation. The main system components are a hierarchical dynamic Bayesian network (DBN) for detecting linguistic keyword features and long short-term memory (LSTM) recurrent neural networks which model phoneme context and emotional history to predict the affective state of the user. Experimental evaluations on the SAL corpus of non-prototypical real-life emotional speech data consider a number of variants of our recognition framework: continuous emotion estimation from low-level feature frames is evaluated as a new alternative to the common approach of computing statistical functionals of given speech turns. Further performance gains are achieved by discriminatively training LSTM networks and by using bidirectional context information, leading to a quadrant prediction F1-measure of up to 51.3 %, which is only 7.6 % below the average inter-labeler consistency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Combining Long Short-Term Memory and Dynamic Bayesian Networks for Incremental Emotion-Sensitive Artificial Listening

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Signal Processing

Lead the way for us

Journal: IEEE Journal of Selected Topics in Signal Processing	Publication Date: Oct 1, 2010
Citations: 215

Similar Papers

Kazakh and Russian Languages Identification Using Long Short-Term Memory Recurrent Neural Networks
Zhanibek Kozhirbayev ... Muslima Karabalayeva
-
Zhanibek Kozhirbayev, et. al.Zhanibek Kozhirbayev ... Muslima Karabalayeva
01 Sep 2017
01 Sep 2017

Bidirectional Quaternion Long Short-term Memory Recurrent Neural Networks for Speech Recognition
Titouan Parcollet ... Georges Linares
-
Titouan Parcollet, et. al.Titouan Parcollet ... Georges Linares
01 May 2019
01 May 2019

Deep Learning with Long Short-Term Memory Recurrent Neural Network for Daily Container Volumes of Storage Yard Predictions in Port
Yinping Gao ... Daofang Chang
-
Yinping Gao, et. al.Yinping Gao ... Daofang Chang
01 Oct 2018
01 Oct 2018

Long short-term memory (LSTM) recurrent neural network for low-flow hydrological time series forecasting
Bibhuti Bhusan Sahoo ... Ramakar Jha
Acta Geophysica | VOL. 67
Bibhuti Bhusan Sahoo, et. al.Bibhuti Bhusan Sahoo ... Ramakar Jha
20 Jul 2019
Acta Geophysica | VOL. 67

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combining Long Short-Term Memory and Dynamic Bayesian Networks for Incremental Emotion-Sensitive Artificial Listening

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Signal Processing