Enhanced Deep Learning Techniques for Real-Time Speech Emotion Recognition in Multilingual Contexts

Donia Y Badawood,Fahd M Aldosari

doi:10.48084/etasr.9229

Abstract

Emotion recognition from speech is crucial for advancing human-computer interactions, enabling more natural and empathetic communication. This study proposes a novel Speech Emotion Recognition (SER) framework that integrates Convolutional Neural Networks (CNNs) and transformer-based architectures to capture local and contextual speech features. The model demonstrates strong classification performance, particularly for prominent emotions such as anger, sadness, and happiness. However, challenges persist in detecting less frequent emotions, such as surprise and calm, highlighting areas for improvement. The limitations of current datasets, such as limited linguistic diversity, are discussed. The findings underscore the model's robustness and identify avenues for future enhancement, such as incorporating more diverse datasets and employing techniques such as transfer learning. Future work will explore multimodal approaches and real-time implementation on edge devices to improve the system's adaptability in real-world scenarios.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhanced Deep Learning Techniques for Real-Time Speech Emotion Recognition in Multilingual Contexts

Abstract

Talk to us

Similar Papers

More From: Engineering, Technology & Applied Science Research

Lead the way for us

Journal: Engineering, Technology & Applied Science Research	Publication Date: Dec 2, 2024
License type: CC BY 4.0

Similar Papers

Recognition of Emotions of Speech and Mood of Music: A Review
Gaurav Agarwal ... Sachi Gupta
-
Gaurav Agarwal, et. al.Gaurav Agarwal ... Sachi Gupta
01 Jan 2018
01 Jan 2018

EEG emotion recognition using attention-based convolutional transformer neural network
Linlin Gong ... Wanzhong Chen
Biomedical Signal Processing and Control | VOL. 84
Linlin Gong, et. al.Linlin Gong ... Wanzhong Chen
10 Mar 2023
Biomedical Signal Processing and Control | VOL. 84

Spontaneous Speech Emotion Recognition Using Multiscale Deep Convolutional LSTM
Shiqing Zhang ... Qi Tian
IEEE Transactions on Affective Computing | VOL. 13
Shiqing Zhang, et. al.Shiqing Zhang ... Qi Tian
01 Apr 2022
IEEE Transactions on Affective Computing | VOL. 13

A comprehensive study on bilingual and multilingual speech emotion recognition using a two-pass classification scheme.
Panikos Heracleous ... Akio Yoneyama
PLOS ONE | VOL. 14
Panikos Heracleous, et. al.Panikos Heracleous ... Akio Yoneyama
15 Aug 2019
PLOS ONE | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhanced Deep Learning Techniques for Real-Time Speech Emotion Recognition in Multilingual Contexts

Abstract

Talk to us

Similar Papers

More From: Engineering, Technology &amp; Applied Science Research

More From: Engineering, Technology & Applied Science Research