Implicitly Aligning Joint Distributions for Cross-Corpus Speech Emotion Recognition

Cheng Lu,Yan Zhao,Yuan Zong,Chuangao Tang,Jie Zhu,Hongli Chang,Sunan Li,Hailun Lian

doi:10.3390/electronics11172745

Cheng Lu, Yan Zhao + Show 6 more

Open Access

https://doi.org/10.3390/electronics11172745

Copy DOI

Journal: Electronics	Publication Date: Aug 31, 2022
Citations: 2	License type: CC BY 4.0

Affiliation: Southeast University

Abstract

In this paper, we investigate the problem of cross-corpus speech emotion recognition (SER), in which the training (source) and testing (target) speech samples belong to different corpora. This case thus leads to a feature distribution mismatch between the source and target speech samples. Hence, the performance of most existing SER methods drops sharply. To solve this problem, we propose a simple yet effective transfer subspace learning method called joint distribution implicitly aligned subspace learning (JIASL). The basic idea of JIASL is very straightforward, i.e., building an emotion discriminative and corpus invariant linear regression model under an implicit distribution alignment strategy. Following this idea, we first make use of the source speech features and emotion labels to endow such a regression model with emotion-discriminative ability. Then, a well-designed reconstruction regularization term, jointly considering the marginal and conditional distribution alignments between the speech samples in both corpora, is adopted to implicitly enable the regression model to predict the emotion labels of target speech samples. To evaluate the performance of our proposed JIASL, extensive cross-corpus SER experiments are carried out, and the results demonstrate the promising performance of the proposed JIASL in coping with the tasks of cross-corpus SER.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Implicitly Aligning Joint Distributions for Cross-Corpus Speech Emotion Recognition

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Adapting Multiple Distributions for Bridging Emotions from Different Speech Corpora.
Yuan Zong ... Hailun Lian
Entropy | VOL. 24
Yuan Zong, et. al.Yuan Zong ... Hailun Lian
05 Sep 2022
Entropy | VOL. 24

Deep Implicit Distribution Alignment Networks for cross-Corpus Speech Emotion Recognition
Yan Zhao ... Jincen Wang
-
Yan Zhao, et. al.Yan Zhao ... Jincen Wang
04 Jun 2023
04 Jun 2023

Multi-scale discrepancy adversarial network for cross-corpus speech emotion recognition
Wanlu Zheng ... Yuan Zong
Virtual Reality & Intelligent Hardware | VOL. 3
Wanlu Zheng, et. al.Wanlu Zheng ... Yuan Zong
01 Feb 2021
Virtual Reality & Intelligent Hardware | VOL. 3

Progressive distribution adapted neural networks for cross-corpus speech emotion recognition.
Yuan Zong ... Ercui Feng
Frontiers in neurorobotics | VOL. 16
Yuan Zong, et. al.Yuan Zong ... Ercui Feng
15 Sep 2022
Frontiers in neurorobotics | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Implicitly Aligning Joint Distributions for Cross-Corpus Speech Emotion Recognition

Abstract

Talk to us

Similar Papers

More From: Electronics