Performance Comparison of Deep Learning Algorithm for Speech Emotion Recognition

I Gusti Bagus Arya Pradnja Paramitha ,Muji Ernawati Muji Ernawati,Hendra Budi Kusnawan Hendra Budi Kusnawan

doi:10.29303/jcosine.v6i2.443

I Gusti Bagus Arya Pradnja Paramitha , Muji Ernawati Muji Ernawati + Show 1 more

Open Access

https://doi.org/10.29303/jcosine.v6i2.443

Copy DOI

Abstract

One of the problems in Speech emotion recognition is related to time series data, while the feedforward process in neural networks is unidirectional where the results from one layer are directly channeled to the next layer. This kind of feedforward process cannot store past data. Thus, if Deep Neural Network (DNN) is used for Speech emotion recognition, some problems arise, such as the speech rate of the speaker. DNN cannot analyze the existing acoustic patterns and so cannot map different levels of speech rate. Another method that can take input at once while retaining relevant data in the previous process is the Recurrent Neural Network (RNN). This paper presents the characteristics of the RNN method consisting of LSTM and GRU techniques for Speech emotion recognition using the Berlin EMODB dataset. The dataset is divided into 80% for training and 20% for testing. The feature extraction methods used are Zero crossing Rate (ZCR), Mel Frequency Cepstral Coefficients (MFCC), Root Mean Square Energy (RMSE), Mel Spectrogram, and Chroma. This study compares the CNN, LSTM, and GRU algorithms. The classification results show that the CNN algorithm gets better results, namely 79.13%. Meanwhile, LSTM and GRU only got an accuracy of 55.76% and 55.14%, respectively

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance Comparison of Deep Learning Algorithm for Speech Emotion Recognition

Abstract

Talk to us

Similar Papers

More From: Journal of Computer Science and Informatics Engineering (J-Cosine)

Lead the way for us

Journal: Journal of Computer Science and Informatics Engineering (J-Cosine)	Publication Date: Dec 21, 2022
License type: cc-by-sa

Similar Papers

Hybrid CNN-BiLSTM architecture with multiple attention mechanisms to enhance speech emotion recognition
Poorna S.S ... Sundararaman Gopalan
Biomedical Signal Processing and Control | VOL. -
Poorna S.S, et. al.Poorna S.S ... Sundararaman Gopalan
01 Sep 2024
Biomedical Signal Processing and Control | VOL. -

Study on emotion recognition and companion Chatbot using deep neural network
Ming-Che Lee ... Shu-Yin Chiang
Multimedia Tools and Applications | VOL. 79
Ming-Che Lee, et. al.Ming-Che Lee ... Shu-Yin Chiang
27 Mar 2020
Multimedia Tools and Applications | VOL. 79

A comprehensive study on bilingual and multilingual speech emotion recognition using a two-pass classification scheme.
Panikos Heracleous ... Akio Yoneyama
PLOS ONE | VOL. 14
Panikos Heracleous, et. al.Panikos Heracleous ... Akio Yoneyama
15 Aug 2019
PLOS ONE | VOL. 14

EMOTIONS RECOGNITION IN HUMAN SPEECH USING DEEP NEURAL NETWORKS
E Yu Shchetinin
Vestnik komp'iuternykh i informatsionnykh tekhnologii | VOL. -
E Yu ShchetininE Yu Shchetinin
01 Jan 2020
Vestnik komp'iuternykh i informatsionnykh tekhnologii | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Comparison of Deep Learning Algorithm for Speech Emotion Recognition

Abstract

Talk to us

Similar Papers

More From: Journal of Computer Science and Informatics Engineering (J-Cosine)