Emotion Recognition Combining Acoustic and Linguistic Features Based on Speech Recognition Results

Misaki Sakurai,Tetsuo Kosaka

doi:10.1109/gcce53005.2021.9621810

Abstract

In this study, a speech emotion recognition method that uses both acoustic and linguistic features is studied. Various emotion recognition methods using both the abovementioned types of features have been proposed. However, most studies that use linguistic features are based on reference transcripts because emotional speech recognition is considered more difficult than non-emotional speech recognition. The acoustic features of emotional speech differ from those of non-emotional speech, and these features vary greatly depending on the emotion type and intensity. We have been studying a new emotional speech recognition method that uses a combination of both acoustic model and language model adaptation and thereby achieved high recognition performance on an emotional speech task. In this study, we attempt to extract linguistic features using speech recognition results. The word recognition accuracy of the system was 82.2%, and recognition errors were observed. Despite this, the linguistic features extracted from the recognition results are useful, and we demonstrate that the combination of linguistic and acoustic features is effective for emotion recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Emotion Recognition Combining Acoustic and Linguistic Features Based on Speech Recognition Results

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speech Signal Imaging and Emotion Recognition Based on Symmetric-Diagonal Matrix Model
Zijun Yang ... Aoran Xi
-
Zijun Yang, et. al.Zijun Yang ... Aoran Xi
01 Jan 2023
01 Jan 2023

A Deep Learning Method Using Gender-Specific Features for Emotion Recognition.
Li-Min Zhang ... Giap Weng Ng
Sensors | VOL. 23
Li-Min Zhang, et. al.Li-Min Zhang ... Giap Weng Ng
25 Jan 2023
Sensors | VOL. 23

EdgeRNN: A Compact Speech Recognition Network With Spatio-Temporal Features for Edge Computing
Shunzhi Yang ... Kai Ye
IEEE Access | VOL. 8
Shunzhi Yang, et. al.Shunzhi Yang ... Kai Ye
01 Jan 2020
IEEE Access | VOL. 8

Performance comparison of speaker and emotion recognition
A Revathy ... V Mohan
-
A Revathy, et. al.A Revathy ... V Mohan
01 Mar 2015
01 Mar 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Emotion Recognition Combining Acoustic and Linguistic Features Based on Speech Recognition Results

Abstract

Talk to us

Similar Papers