Speech emotion recognition based on emotion perception

Gang Liu,Ce Wang,Shifang Cai

doi:10.1186/s13636-023-00289-4

Abstract

Speech emotion recognition (SER) is a hot topic in speech signal processing. With the advanced development of the cheap computing power and proliferation of research in data-driven methods, deep learning approaches are prominent solutions to SER nowadays. SER is a challenging task due to the scarcity of datasets and the lack of emotion perception. Most existing networks of SER are based on computer vision and natural language processing, so the applicability for extracting emotion is not strong. Drawing on the research results of brain science on emotion computing and inspired by the emotional perceptive process of the human brain, we propose an approach based on emotional perception, which designs a human-like implicit emotional attribute classification and introduces implicit emotional information through multi-task learning. Preliminary experiments show that the unweighted accuracy (UA) of the proposed method has increased by 2.44%, and weighted accuracy (WA) 3.18% (both absolute values) on the Interactive Emotional Dyadic Motion Capture (IEMOCAP) dataset, which verifies the effectiveness of our method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Audio, Speech, and Music Processing	Publication Date: May 12, 2023
Citations: 6	License type: open-access

R Discovery Prime

R Discovery Prime

Speech emotion recognition based on emotion perception

Abstract

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing

Lead the way for us

Similar Papers

BAT: Block and token self-attention for speech emotion recognition
Jianjun Lei ... Ying Wang
Neural Networks | VOL. 156
Jianjun Lei, et. al.Jianjun Lei ... Ying Wang
29 Sep 2022
Neural Networks | VOL. 156

Head Fusion: Improving the Accuracy and Robustness of Speech Emotion Recognition on the IEMOCAP and RAVDESS Dataset
Mingke Xu ... Fan Zhang
IEEE Access | VOL. 9
Mingke Xu, et. al.Mingke Xu ... Fan Zhang
01 Jan 2020
IEEE Access | VOL. 9

Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation
Mingke Xu ... Fan Zhang
-
Mingke Xu, et. al.Mingke Xu ... Fan Zhang
06 Jun 2021
06 Jun 2021

Hierarchical Network with Decoupled Knowledge Distillation for Speech Emotion Recognition
Ziping Zhao ... Björn Schuller
-
Ziping Zhao, et. al.Ziping Zhao ... Björn Schuller
04 Jun 2023
04 Jun 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech emotion recognition based on emotion perception

Abstract

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing