Recognition of speech emotion using custom 2D-convolution neural network deep learning algorithm

Kudakwashe Zvarevashe,Oludayo O Olugbara

doi:10.3233/ida-194747

Abstract

Speech emotion recognition has become the heart of most human computer interaction applications in the modern world. The growing need to develop emotionally intelligent devices has opened up a lot of research opportunities. Most researchers in this field have applied the use of handcrafted features and machine learning techniques in recognising speech emotion. However, these techniques require extra processing steps and handcrafted features are usually not robust. They are computationally intensive because the curse of dimensionality results in low discriminating power. Research has shown that deep learning algorithms are effective for extracting robust and salient features in dataset. In this study, we have developed a custom 2D-convolution neural network that performs both feature extraction and classification of vocal utterances. The neural network has been evaluated against deep multilayer perceptron neural network and deep radial basis function neural network using the Berlin database of emotional speech, Ryerson audio-visual emotional speech database and Surrey audio-visual expressed emotion corpus. The described deep learning algorithm achieves the highest precision, recall and F1-scores when compared to other existing algorithms. It is observed that there may be need to develop customized solutions for different language settings depending on the area of applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Recognition of speech emotion using custom 2D-convolution neural network deep learning algorithm

Abstract

Talk to us

Similar Papers

More From: Intelligent Data Analysis

Lead the way for us

Journal: Intelligent Data Analysis	Publication Date: Sep 30, 2020
Citations: 23

Similar Papers

Continuous tracking of the emotion temperature
Jesús B Alonso ... Agustín Sánchez-Medina
Neurocomputing | VOL. 255
Jesús B Alonso, et. al.Jesús B Alonso ... Agustín Sánchez-Medina
27 Mar 2017
Neurocomputing | VOL. 255

Robust emotion recognition in noisy speech via sparse representation
Xiaoming Zhao ... Shiqing Zhang
Neural Computing and Applications | VOL. 24
Xiaoming Zhao, et. al.Xiaoming Zhao ... Shiqing Zhang
29 Mar 2013
Neural Computing and Applications | VOL. 24

Advances in Speech Emotion Recognition and Analysis: A Review of Applied Machine Learning Methodologies
Ankit Kumar ... Sachi Gupta
International Journal for Research in Applied Science and Engineering Technology | VOL. 12
Ankit Kumar, et. al.Ankit Kumar ... Sachi Gupta
30 Apr 2024
International Journal for Research in Applied Science and Engineering Technology | VOL. 12

Time-Distributed Attention-Layered Convolution Neural Network with Ensemble Learning using Random Forest Classifier for Speech Emotion Recognition
Yalamanchili Bhanusree ... Samayamantula Srinivas Kumar
Journal of Information and Communication Technology | VOL. 22
Yalamanchili Bhanusree, et. al.Yalamanchili Bhanusree ... Samayamantula Srinivas Kumar
01 Jan 2023
Journal of Information and Communication Technology | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recognition of speech emotion using custom 2D-convolution neural network deep learning algorithm

Abstract

Talk to us

Similar Papers

More From: Intelligent Data Analysis