Performance of deer hunting optimization based deep learning algorithm for speech emotion recognition

Gaurav Agarwal,Hari Om

doi:10.1007/s11042-020-10118-x

Abstract

This paper proposes a speech emotion recognition technique based on Optimized Deep Neural Network. The speech signals are denoised by presenting a novel adaptive wavelet transform with a modified galactic swarm optimization algorithm (AWT_MGSO). From the noise removed speech signals, the spectral features like LPC (Linear Prediction Coefficients), MFCC (Mel frequency cepstral coefficients), PSD (power spectral density) and prosodic features like energy, entropy, formant frequencies and pitch are extracted and certain features are selected by ASFO (Adaptive Sunflower Optimization Algorithm). The optimized DNN-DHO (Deep Neural Network with Deer Hunting Optimization Algorithm) is proposed for emotion classification. An enhanced squirrel search algorithm is proposed to update the weight in the optimized DNN_DHO classifier. In this study, all the eight emotions of the speech from RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song) and TESS (Toronto Emotional Speech Set) databases for English and IITKGP-SEHSC (Indian Institute of Technology Kharagpur Simulated Emotion Hindi Speech Corpus) database for Hindi are classified. The experimental results are obtained and compared with the classifiers such as DNN_DHO, DNN (Deep Neural Network) and DAE (Deep Auto Encoder). The experimental results show that the proposed algorithm obtains maximum accuracy as 97.85% by the TESS dataset, 97.14% by the RAVDESS dataset and 93.75% by the IITKGP-SEHSC dataset by the DNN-HHO classifier.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance of deer hunting optimization based deep learning algorithm for speech emotion recognition

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Journal: Multimedia Tools and Applications	Publication Date: Nov 16, 2020
Citations: 33

Similar Papers

Fusion of mel and gammatone frequency cepstral coefficients for speech emotion recognition using deep C-RNN
U Kumaran ... Senthil Murugan Nagarajan
International Journal of Speech Technology | VOL. 24
U Kumaran, et. al.U Kumaran ... Senthil Murugan Nagarajan
13 Jan 2021
International Journal of Speech Technology | VOL. 24

When Old Meets New: Emotion Recognition from Speech Signals
Keith April Araño ... Peter Gloor
Cognitive Computation | VOL. 13
Keith April Araño, et. al.Keith April Araño ... Peter Gloor
19 Apr 2021
Cognitive Computation | VOL. 13

Deep Learning Based Emotion Classification Using Mel Frequency Magnitude Coefficient
Siba Prasad Mishra ... Suman Deb
-
Siba Prasad Mishra, et. al.Siba Prasad Mishra ... Suman Deb
04 Mar 2023
04 Mar 2023

Emotion recognition of human speech using deep learning method and MFCC features
Sumon Kumar Hazra ... Nasim Adnan
Radioelectronic and Computer Systems | VOL. -
Sumon Kumar Hazra, et. al.Sumon Kumar Hazra ... Nasim Adnan
29 Nov 2022
Radioelectronic and Computer Systems | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance of deer hunting optimization based deep learning algorithm for speech emotion recognition

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications