Compensation Techniques for Speaker Variability in Continuous Emotion Prediction

Ting Dang,Vidhyasaharan Sethu,Eliathamby Ambikairajah

doi:10.1109/taffc.2018.2883044

Abstract

Continuous time-varying prediction of emotions based on speech in terms of attributes (i.e., arousal) has received considerable attention in the past few years. However, the variability introduced by factors not related to emotion, such as speaker and phonetic variability, which in turn may lead to less reliable models and less accurate emotion predictions, has not been fully explored yet. In particular, even though speaker variability has been shown to be a significant confounding factor in continuous emotion prediction systems, there remains a paucity of analyses about how speaker variability affects continuous emotion prediction systems and which methods can be applied to compensate for this variability. This paper first formulates speaker variability systematically in terms of probability distributions in both feature and model spaces, and quantifies the effect of speaker variability by comparing inter- and intra-speaker variability between speaker-dependent models. Second, two compensation techniques based on partial least squares dimensional reduction and feature mapping are proposed. Finally, the effectiveness of the proposed techniques is validated on three databases, across which they show consistent improvement in arousal, valence and dominance prediction. Additional quantitative analyse reveals that the two proposed techniques compensate for speaker variability in both the feature and model spaces simultaneously.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Compensation Techniques for Speaker Variability in Continuous Emotion Prediction

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Affective Computing

Lead the way for us

Journal: IEEE Transactions on Affective Computing	Publication Date: Apr 1, 2021
Citations: 1

Similar Papers

Pitch accent prediction: effects of genre and speaker
Jiahong Yuan ... Jason M Brenier
-
Jiahong Yuan, et. al.Jiahong Yuan ... Jason M Brenier
04 Sep 2005
04 Sep 2005

Phonetic and speaker variations in automatic emotion classification
Vidhyasaharan Sethu ... Julien Epps
-
Vidhyasaharan Sethu, et. al.Vidhyasaharan Sethu ... Julien Epps
22 Sep 2008
22 Sep 2008

Speaker adapted dynamic lexicons containing phonetic deviations of words
Bahram Vazirnezhad ... Farshad Almasganj
Journal of Zhejiang University-SCIENCE A | VOL. 10
Bahram Vazirnezhad, et. al.Bahram Vazirnezhad ... Farshad Almasganj
01 Oct 2009
Journal of Zhejiang University-SCIENCE A | VOL. 10

A PLLR and multi-stage Staircase Regression framework for speech-based emotion prediction
Zhaocheng Huang ... Julien Epps
-
Zhaocheng Huang, et. al.Zhaocheng Huang ... Julien Epps
01 Mar 2017
01 Mar 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Compensation Techniques for Speaker Variability in Continuous Emotion Prediction

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Affective Computing