Speech Emotion Recognition Using Semantic Information

Panagiotis Tzirakis,Anh Nguyen,Bjorn W Schuller,Stefanos Zafeiriou

doi:10.1109/icassp39728.2021.9414866

Abstract

Speech emotion recognition is a crucial problem manifesting in a multitude of applications such as human computer interaction and education. Although several advancements have been made in the recent years, especially with the advent of Deep Neural Networks (DNN), most of the studies in the literature fail to consider the semantic information in the speech signal. In this paper, we propose a novel framework that can capture both the semantic and the paralinguistic information in the signal. In particular, our framework is comprised of a semantic feature extractor, that captures the semantic information, and a paralinguistic feature extractor, that captures the paralinguistic information. Both semantic and paraliguistic features are then combined to a unified representation using a novel attention mechanism. The unified feature vector is passed through a LSTM to capture the temporal dynamics in the signal, before the final prediction. To validate the effectiveness of our framework, we use the popular SEWA dataset of the AVEC challenge series and compare with the three winning papers. Our model provides state-of-the-art results in the valence and liking dimensions. <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sup>

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech Emotion Recognition Using Semantic Information

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Progress in speech emotion recognition
Xueying Zhang ... Shufei Duan
-
Xueying Zhang, et. al.Xueying Zhang ... Shufei Duan
01 Nov 2015
01 Nov 2015

In-depth investigation of speech emotion recognition studies from past to present –The importance of emotion recognition from speech signal for AI–
Yeşim Ülgen Sönmez ... Asaf Varol
Intelligent Systems with Applications | VOL. 22
Yeşim Ülgen Sönmez, et. al.Yeşim Ülgen Sönmez ... Asaf Varol
11 Mar 2024
Intelligent Systems with Applications | VOL. 22

Artificial bandwidth extension using [formula omitted] sampled-data control theory
Deepika Gupta ... Hanumant Singh Shekhawat
Speech Communication | VOL. 134
Deepika Gupta, et. al.Deepika Gupta ... Hanumant Singh Shekhawat
06 Sep 2021
Speech Communication | VOL. 134

Speech Signal Imaging and Emotion Recognition Based on Symmetric-Diagonal Matrix Model
Zijun Yang ... Aoran Xi
-
Zijun Yang, et. al.Zijun Yang ... Aoran Xi
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech Emotion Recognition Using Semantic Information

Abstract

Talk to us

Similar Papers