Speech Emotion Recognition under White Noise

Chengwei Huang,Guoming Chen,Yongqiang Bao,Hua Yu,Li Zhao

doi:10.2478/aoa-2013-0054

Abstract

Abstract Speaker‘s emotional states are recognized from speech signal with Additive white Gaussian noise (AWGN). The influence of white noise on a typical emotion recogniztion system is studied. The emotion classifier is implemented with Gaussian mixture model (GMM). A Chinese speech emotion database is used for training and testing, which includes nine emotion classes (e.g. happiness, sadness, anger, surprise, fear, anxiety, hesitation, confidence and neutral state). Two speech enhancement algorithms are introduced for improved emotion classification. In the experiments, the Gaussian mixture model is trained on the clean speech data, while tested under AWGN with various signal to noise ratios (SNRs). The emotion class model and the dimension space model are both adopted for the evaluation of the emotion recognition system. Regarding the emotion class model, the nine emotion classes are classified. Considering the dimension space model, the arousal dimension and the valence dimension are classified into positive regions or negative regions. The experimental results show that the speech enhancement algorithms constantly improve the performance of our emotion recognition system under various SNRs, and the positive emotions are more likely to be miss-classified as negative emotions under white noise environment.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech Emotion Recognition under White Noise

Abstract

Talk to us

Similar Papers

More From: Archives of Acoustics

Lead the way for us

Journal: Archives of Acoustics	Publication Date: Dec 1, 2013
Citations: 35

Similar Papers

Emotion Classification Based on Brain Functional Connectivity Network
Xiaofang Sun ... Xiangwei Zheng
-
Xiaofang Sun, et. al.Xiaofang Sun ... Xiangwei Zheng
16 Dec 2020
16 Dec 2020

Marginal energy density over the low frequency range as a feature for voiced/non-voiced detection in noisy speech signals
Pooja Jain ... Ram Bilas Pachori
Journal of the Franklin Institute | VOL. 350
Pooja Jain, et. al.Pooja Jain ... Ram Bilas Pachori
21 Jan 2013
Journal of the Franklin Institute | VOL. 350

Speech enhancement based on a constrained sinusoidal model
Jesper Jensen ... John H L Hansen
-
Jesper Jensen, et. al.Jesper Jensen ... John H L Hansen
16 Oct 2000
16 Oct 2000

Blind Determination of the Signal to Noise Ratio of Speech Signals Based on Estimation Combination of Multiple Features
Russell Ondusko ... Mark C Huggins
-
Russell Ondusko, et. al.Russell Ondusko ... Mark C Huggins
01 Dec 2006
01 Dec 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech Emotion Recognition under White Noise

Abstract

Talk to us

Similar Papers

More From: Archives of Acoustics