Cross-Corpus Speech Emotion Recognition Based on Multi-Task Learning and Subdomain Adaptation.

Hongliang Fu,Wenzhuo Duan,Chen Huang,Yang Wang,Zhihao Zhuang

doi:10.3390/e25010124

Abstract

To solve the problem of feature distribution discrepancy in cross-corpus speech emotion recognition tasks, this paper proposed an emotion recognition model based on multi-task learning and subdomain adaptation, which alleviates the impact on emotion recognition. Existing methods have shortcomings in speech feature representation and cross-corpus feature distribution alignment. The proposed model uses a deep denoising auto-encoder as a shared feature extraction network for multi-task learning, and the fully connected layer and softmax layer are added before each recognition task as task-specific layers. Subsequently, the subdomain adaptation algorithm of emotion and gender features is added to the shared network to obtain the shared emotion features and gender features of the source domain and target domain, respectively. Multi-task learning effectively enhances the representation ability of features, a subdomain adaptive algorithm promotes the migrating ability of features and effectively alleviates the impact of feature distribution differences in emotional features. The average results of six cross-corpus speech emotion recognition experiments show that, compared with other models, the weighted average recall rate is increased by 1.89~10.07%, the experimental results verify the validity of the proposed model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy	Publication Date: Jan 7, 2023
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Cross-Corpus Speech Emotion Recognition Based on Multi-Task Learning and Subdomain Adaptation.

Abstract

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

Exploring corpus-invariant emotional acoustic feature for cross-corpus speech emotion recognition
Hailun Lian ... Yuan Zong
Expert Systems With Applications | VOL. 258
Hailun Lian, et. al.Hailun Lian ... Yuan Zong
22 Aug 2024
Expert Systems With Applications | VOL. 258

Speech emotion recognition system based on wavelet transform and multi-task learning
Yunxiang Liu ... Zhang Kexin
-
Yunxiang Liu, et. al.Yunxiang Liu ... Zhang Kexin
24 Nov 2022
24 Nov 2022

Multi-scale discrepancy adversarial network for cross-corpus speech emotion recognition
Wanlu Zheng ... Yuan Zong
Virtual Reality & Intelligent Hardware | VOL. 3
Wanlu Zheng, et. al.Wanlu Zheng ... Yuan Zong
01 Feb 2021
Virtual Reality & Intelligent Hardware | VOL. 3

Cross-corpus speech emotion recognition using subspace learning and domain adaption
Xuan Cao ... Jiawei Ru
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2022
Xuan Cao, et. al.Xuan Cao ... Jiawei Ru
27 Dec 2022
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross-Corpus Speech Emotion Recognition Based on Multi-Task Learning and Subdomain Adaptation.

Abstract

Talk to us

Similar Papers

More From: Entropy