Transferable discriminant linear regression for cross-corpus speech emotion recognition

Shaokai Li,Peng Song,Wenjing Zhang

doi:10.1016/j.apacoust.2022.108919

Abstract

Speech emotion recognition (SER) has attracted much interest recently due to its wide applications. However, it should be noted that most SER methods are conducted on the assumption that the training and testing data are from the same database. In real applications, this assumption does not hold, and the recognition performance will be significantly degraded. To solve this problem, we present a novel transferable discriminant linear regression (TDLR) approach for cross-corpus SER. Specifically, first, we introduce a non-negative label relaxation linear regression on source corpus to help learn transferable feature representations. Second, we propose a simple but effective strategy to keep the linear relationship between the labels of source and target corpora. Meanwhile, we utilize the discriminative maximum mean discrepancy (MMD) as the distance metric between two databases. Furthermore, we use the graph Laplacian to preserve the geometric structure of samples, which can further reduce the distribution gap between the two databases. Additionally, to better obtain the intrinsic properties of data and make the model robust, we impose an ℓ2,1-norm on the transformation matrices. Extensive experiments have been carried out on several standard databases, and the results show that TDLR can obtain better recognition performance than several state-of-the-art algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transferable discriminant linear regression for cross-corpus speech emotion recognition

Abstract

Talk to us

Similar Papers

More From: Applied Acoustics

Lead the way for us

Journal: Applied Acoustics	Publication Date: Jul 25, 2022
Citations: 7

Similar Papers

Cross-Corpus Speech Emotion Recognition Based on Joint Transfer Subspace Learning and Regression
Weijian Zhang ... Chao Sheng
IEEE Transactions on Cognitive and Developmental Systems | VOL. 14
Weijian Zhang, et. al.Weijian Zhang ... Chao Sheng
01 Jun 2022
IEEE Transactions on Cognitive and Developmental Systems | VOL. 14

Latent sparse transfer subspace learning for cross-corpus facial expression recognition
Wenjing Zhang ... Weijian Zhang
Digital Signal Processing | VOL. 116
Wenjing Zhang, et. al.Wenjing Zhang ... Weijian Zhang
04 Jun 2021
Digital Signal Processing | VOL. 116

Progressively Discriminative Transfer Network for Cross-Corpus Speech Emotion Recognition
Cheng Lu ... Chuangao Tang
Entropy | VOL. 24
Cheng Lu, et. al.Cheng Lu ... Chuangao Tang
29 Jul 2022
Entropy | VOL. 24

Cross Corpus Speech Emotion Recognition using transfer learning and attention-based fusion of Wav2Vec2 and prosody features
Navid Naderi ... Babak Nasersharif
Knowledge-Based Systems | VOL. 277
Navid Naderi, et. al.Navid Naderi ... Babak Nasersharif
25 Jul 2023
Knowledge-Based Systems | VOL. 277

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transferable discriminant linear regression for cross-corpus speech emotion recognition

Abstract

Talk to us

Similar Papers

More From: Applied Acoustics