Exploitation of Phase-Based Features for Whispered Speech Emotion Recognition

Jun Deng,Xinzhou Xu,Bjorn Schuller,Zixing Zhang,Sascha Fruhholz

doi:10.1109/access.2016.2591442

Abstract

Features for speech emotion recognition are usually dominated by the spectral magnitude information while they ignore the use of the phase spectrum because of the difficulty of properly interpreting it. Motivated by recent successes of phase-based features for speech processing, this paper investigates the effectiveness of phase information for whispered speech emotion recognition. We select two types of phase-based features (i.e., modified group delay features and all-pole group delay features), both which have shown wide applicability to all sorts of different speech analysis and are now studied in whispered speech emotion recognition. When exploiting these features, we propose a new speech emotion recognition framework, employing outer product in combination with power and L2 normalization. The according technique encodes any variable length sequence of the phase-based features into a fixed dimension vector regardless of the length of the input sequence. The resulting representation is fed to train a classification model with a linear kernel classifier. Experimental results on the Geneva Whispered Emotion Corpus database, including normal and whispered phonation, demonstrate the effectiveness of the proposed method when compared with other modern systems. It is also shown that, combining phase information with magnitude information could significantly improve performance over the common systems solely adopting magnitude information.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2016
Citations: 88	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Exploitation of Phase-Based Features for Whispered Speech Emotion Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Learning affective representations based on magnitude and dynamic relative phase information for speech emotion recognition
Lili Guo ... Seiichi Nakagawa
Speech Communication | VOL. 136
Lili Guo, et. al.Lili Guo ... Seiichi Nakagawa
20 Dec 2021
Speech Communication | VOL. 136

Modulation spectral features for speech emotion recognition using deep neural networks
Premjeet Singh ... Goutam Saha
Speech Communication | VOL. 146
Premjeet Singh, et. al.Premjeet Singh ... Goutam Saha
19 Nov 2022
Speech Communication | VOL. 146

Fisher Kernels on Phase-Based Features for Speech Emotion Recognition
Jun Deng ... Didier Grandjean
-
Jun Deng, et. al.Jun Deng ... Didier Grandjean
25 Dec 2016
25 Dec 2016

Music genre classification by fusion of Modified Group Delay and Melodic Features
Rajeev Rajan ... Hema A Murthy
-
Rajeev Rajan, et. al.Rajeev Rajan ... Hema A Murthy
01 Mar 2017
01 Mar 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploitation of Phase-Based Features for Whispered Speech Emotion Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Access