2-D Attention Based Convolutional Recurrent Neural Network for Speech Emotion Recognition

Aarshana E Winy,Akalya Devi C,Karthika Renuka D,P C Kruthikkha,Ramya P,Soundarya S

doi:10.34010/injiiscom.v3i2.8409

Abstract

Recognizing speech emotions is a formidable challenge due to the complexity of emotions. The function of Speech Emotion Recognition(SER) is significantly impacted by the effects of emotional signals retrieved from speech. The majority of emotional traits, on the other hand, are sensitive to emotionally neutral elements like the speaker, speaking manner, and gender. In this work, we postulate that computing deltas for individual features maintain useful information which is mainly relevant to emotional traits while it minimizes the loss of emotionally irrelevant components, thus leading to fewer misclassifications. Additionally, Speech Emotion Recognition(SER) commonly experiences silent and emotionally unrelated frames. The proposed technique is quite good at picking up important feature representations for emotion relevant features. So here is a two dimensional convolutional recurrent neural network that is attention-based to learn distinguishing characteristics and predict the emotions. The Mel-spectrogram is used for feature extraction. The suggested technique is conducted on IEMOCAP dataset and it has better performance, with 68% accuracy value.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

2-D Attention Based Convolutional Recurrent Neural Network for Speech Emotion Recognition

Abstract

Talk to us

Similar Papers

More From: International Journal of Informatics, Information System and Computer Engineering (INJIISCOM)

Lead the way for us

Journal: International Journal of Informatics, Information System and Computer Engineering (INJIISCOM)	Publication Date: Oct 1, 2022
License type: cc-by-sa

Similar Papers

Comparative Study of Speech Emotion Recognition Based On CNN and CRNN
Nan Jiang ... Dongmei Shao
-
Nan Jiang, et. al.Nan Jiang ... Dongmei Shao
02 Dec 2020
02 Dec 2020

Speech Emotion Recognition Using Convolution Neural Networks and Multi-Head Convolutional Transformer.
Rizwan Ullah ... Lunchakorn Wuttisittikulkij
Sensors | VOL. 23
Rizwan Ullah, et. al.Rizwan Ullah ... Lunchakorn Wuttisittikulkij
07 Jul 2023
Sensors | VOL. 23

An Ensemble Model for Multi-Level Speech Emotion Recognition
Chunjun Zheng ... Chunli Wang
Applied Sciences | VOL. 10
Chunjun Zheng, et. al.Chunjun Zheng ... Chunli Wang
26 Dec 2019
Applied Sciences | VOL. 10

EdgeRNN: A Compact Speech Recognition Network With Spatio-Temporal Features for Edge Computing
Shunzhi Yang ... Kai Ye
IEEE Access | VOL. 8
Shunzhi Yang, et. al.Shunzhi Yang ... Kai Ye
01 Jan 2020
IEEE Access | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

2-D Attention Based Convolutional Recurrent Neural Network for Speech Emotion Recognition

Abstract

Talk to us

Similar Papers

More From: International Journal of Informatics, Information System and Computer Engineering (INJIISCOM)