A State-of-arts Review of Deep Learning Techniques for Speech Emotion Recognition

Kshirod Sarmah Kshirod Sarmah

doi:10.52783/jes.3745

Abstract

In sophisticated Human-Computer Interfaces (HCI), the emotional state of the user is becoming a crucial component that is closely linked to emotional speech recognition. Spoken expressions, which can be a part of human-machine interaction, are an important source of emotional information. Speech emotion recognition (SER) in deep learning (DL) continues to be a hot topic, especially in the field of emotional computing. Current deep learning (DL) and neural network methods are applied in this highly active field of research. This is as a result of its expanding potential, advancements in algorithms, and practical uses. Quantitative factors such as pitch, intensity, accent and Mel-Frequency Cepstral Coefficients (MFCC) can be employed to model the paralinguistic data contained in human speech. To achieve SER, three key procedures are usually followed: data processing, feature selection/extraction, and classification based on the underlying emotional qualities. The nature of these processes and the peculiarities of human speech lend support to the employment of DL techniques for SER implementation. A variety of DL methods have been used for SER tasks in recent affective computing research works; however, only a small number of them capture the underlying ideas and methodologies that can be used to facilitate the three main steps of SER implementation. With a focus on the three SER implementation processes, we provide a state-of-the-art assessment of research conducted over the last ten years that tackled SER tasks from DL perspectives in this work. Various issues are covered in detail, including the problem of low classification accuracy of Speaker-Independent experiments and the related remedies. The review offers principles for SER evaluation as well, emphasizing indicators that can be experimented with and common baselines.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A State-of-arts Review of Deep Learning Techniques for Speech Emotion Recognition

Abstract

Talk to us

Similar Papers

More From: Journal of Electrical Systems

Lead the way for us

Journal: Journal of Electrical Systems	Publication Date: May 16, 2024
License type: CC BY-ND 4.0

Similar Papers

Speech emotion recognition using machine learning — A systematic review
Samaneh Madanian ... Sandra L Schneider
Intelligent Systems with Applications | VOL. 20
Samaneh Madanian, et. al.Samaneh Madanian ... Sandra L Schneider
14 Aug 2023
Intelligent Systems with Applications | VOL. 20

In-depth investigation of speech emotion recognition studies from past to present –The importance of emotion recognition from speech signal for AI–
Yeşim Ülgen Sönmez ... Asaf Varol
Intelligent Systems with Applications | VOL. 22
Yeşim Ülgen Sönmez, et. al.Yeşim Ülgen Sönmez ... Asaf Varol
11 Mar 2024
Intelligent Systems with Applications | VOL. 22

Speech Emotion Recognition Using Machine Learning
Sai Puneeth Theja As ... Swetha M
-
Sai Puneeth Theja As, et. al.Sai Puneeth Theja As ... Swetha M
02 May 2024
02 May 2024

An Empirical Experiment on Feature Extractions Based for Speech Emotion Recognition
Binh Van Duong ... Phuc Nguyen
-
Binh Van Duong, et. al.Binh Van Duong ... Phuc Nguyen
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A State-of-arts Review of Deep Learning Techniques for Speech Emotion Recognition

Abstract

Talk to us

Similar Papers

More From: Journal of Electrical Systems