Enhanced speech emotion detection using deep neural networks

S Lalitha,Shikha Tripathi,Deepa Gupta

doi:10.1007/s10772-018-09572-8

Abstract

This paper focusses on investigation of the effective performance of perceptual based speech features on emotion detection. Mel frequency cepstral coefficients (MFCC’s), perceptual linear predictive cepstrum (PLPC), Mel frequency perceptual linear prediction cepstrum (MFPLPC), bark frequency cepstral coefficients (BFCC), revised perceptual linear prediction coefficient’s (RPLP) and inverted Mel frequency cepstral coefficients (IMFCC) are the perception features considered. The algorithm using these auditory cues is evaluated with deep neural networks (DNN). The novelty of the work involves analysis of the perceptual features to identify predominant features that contain significant emotional information about the speaker. The validity of the algorithm is analysed on publicly available Berlin database with seven emotions in 1-dimensional space termed categorical and 2-dimensional continuous space consisting of emotions in valence and arousal dimensions. Comparative analysis reveals that considerable improvement in the performance of emotion recognition is obtained using DNN with the identified combination of perceptual features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhanced speech emotion detection using deep neural networks

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology

Lead the way for us

Journal: International Journal of Speech Technology	Publication Date: Nov 22, 2018
Citations: 60

Similar Papers

Speech stress recognition using semi-eager learning
Vaijanath V Yerigeri ... L.K Ragha
Cognitive Systems Research | VOL. 65
Vaijanath V Yerigeri, et. al.Vaijanath V Yerigeri ... L.K Ragha
29 Oct 2020
Cognitive Systems Research | VOL. 65

A modified MFCC feature extraction technique For robust speaker recognition
Diksha Sharma ... Israj Ali
-
Diksha Sharma, et. al.Diksha Sharma ... Israj Ali
01 Aug 2015
01 Aug 2015

Speaker Identification Based On MFCC and IMFCC Using GMM-UBM
...
-
, et. al. ...
01 May 2015
01 May 2015

Multiple windowed spectral features for emotion recognition
Yazid Attabi ... Patrick Kenny
-
Yazid Attabi, et. al.Yazid Attabi ... Patrick Kenny
01 May 2013
01 May 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhanced speech emotion detection using deep neural networks

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology