Speech Emotion Recognition Using Deep Neural Networks, Transfer Learning, and Ensemble Classification Techniques

Serban Mihalache,Dragos Burileanu

doi:10.59277/romjist.2023.3-4.10

Abstract

Speech emotion recognition (SER) is the task of determining the affective content present in speech, a promising research area of great interest in recent years, with important applications especially in the field of forensic speech and law enforcement operations, among others. In this paper, systems based on deep neural networks (DNNs) spanning five levels of complexity are proposed, developed, and tested, including systems leveraging transfer learning (TL) for the top modern image recognition deep learning models, as well as several ensemble classification techniques that lead to significant performance increases. The systems were tested on the most relevant SER datasets: EMODB, CREMAD, and IEMOCAP, in the context of: (i) classification: using the standard full sets of emotion classes, as well as additional negative emotion subsets relevant for forensic speech applications; and (ii) regression: using the continuously valued 2D arousal-valence affect space. The proposed systems achieved state-of-the-art results for the full class subset for EMODB (up to 83% accuracy) and performance comparable to other published research for the full class subsets for CREMAD and IEMOCAP (up to 55% and 62% accuracy). For the class subsets focusing only on negative affective content, the proposed solutions offered top performance vs. previously published state of the art results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech Emotion Recognition Using Deep Neural Networks, Transfer Learning, and Ensemble Classification Techniques

Abstract

Talk to us

Similar Papers

More From: Romanian Journal of Information Science and Technology

Lead the way for us

Journal: Romanian Journal of Information Science and Technology	Publication Date: Sep 28, 2023
Citations: 2

Similar Papers

Patient's Pain Recognition by Using Deep Models Based on Transfer Learning
Elaf Noori Saddam ... Saad Mutashar Abbas
-
Elaf Noori Saddam, et. al.Elaf Noori Saddam ... Saad Mutashar Abbas
01 Nov 2022
01 Nov 2022

Deep Convolutional Neural Networks for Feature Extraction in Speech Emotion Recognition
Panikos Heracleous ... Yasser Mohammad
-
Panikos Heracleous, et. al.Panikos Heracleous ... Yasser Mohammad
01 Jan 2019
01 Jan 2019

Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition.
Hua Zhang ... Fangyao Shen
Frontiers in Physiology | VOL. 12
Hua Zhang, et. al.Hua Zhang ... Fangyao Shen
02 Mar 2021
Frontiers in Physiology | VOL. 12

Transfer learning for time series classification
Hassan Ismail Fawaz ... Germain Forestier
-
Hassan Ismail Fawaz, et. al.Hassan Ismail Fawaz ... Germain Forestier
05 Nov 2018
05 Nov 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech Emotion Recognition Using Deep Neural Networks, Transfer Learning, and Ensemble Classification Techniques

Abstract

Talk to us

Similar Papers

More From: Romanian Journal of Information Science and Technology