Cross-corpus speech emotion recognition using subspace learning and domain adaption

Xuan Cao,Maoshen Jia,Tun-Wen Pai,Jiawei Ru

doi:10.1186/s13636-022-00264-5

Abstract

Speech emotion recognition (SER) is a hot topic in speech signal processing. When the training data and the test data come from different corpus, their feature distributions are different, which leads to the degradation of the recognition performance. Therefore, in order to solve this problem, a cross-corpus speech emotion recognition method is proposed based on subspace learning and domain adaptation in this paper. Specifically, training set data and the test set data are used to form the source domain and target domain, respectively. Then, the Hessian matrix is introduced to obtain the subspace for the extracted features in both source and target domains. In addition, an information entropy-based domain adaption method is introduced to construct the common space. In the common space, the difference between the feature distributions in the source domain and target domain is reduced as much as possible. To evaluate the performance of the proposed method, extensive experiments are conducted on cross-corpus speech emotion recognition. Experimental results show that the proposed method achieves better performance compared with some existing subspace learning and domain adaptation methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Audio, Speech, and Music Processing	Publication Date: Dec 27, 2022
Citations: 2	License type: open-access

R Discovery Prime

R Discovery Prime

Cross-corpus speech emotion recognition using subspace learning and domain adaption

Abstract

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing

Lead the way for us

Similar Papers

Progressively Discriminative Transfer Network for Cross-Corpus Speech Emotion Recognition
Cheng Lu ... Chuangao Tang
Entropy | VOL. 24
Cheng Lu, et. al.Cheng Lu ... Chuangao Tang
29 Jul 2022
Entropy | VOL. 24

Unsupervised Adversarial Domain Adaptation Network for Semantic Segmentation
Wei Liu ... Fulin Su
IEEE Geoscience and Remote Sensing Letters | VOL. 17
Wei Liu, et. al.Wei Liu ... Fulin Su
26 Dec 2019
IEEE Geoscience and Remote Sensing Letters | VOL. 17

Multi-scale discrepancy adversarial network for cross-corpus speech emotion recognition
Wanlu Zheng ... Yuan Zong
Virtual Reality & Intelligent Hardware | VOL. 3
Wanlu Zheng, et. al.Wanlu Zheng ... Yuan Zong
01 Feb 2021
Virtual Reality & Intelligent Hardware | VOL. 3

Domain Adaptation and Super-Resolution Based Bi-Directional Semantic Segmentation Method for Remote Sensing Images
Min Liang ... Xili Wang
-
Min Liang, et. al.Min Liang ... Xili Wang
17 Jul 2022
17 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross-corpus speech emotion recognition using subspace learning and domain adaption

Abstract

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing