In-depth investigation of speech emotion recognition studies from past to present –The importance of emotion recognition from speech signal for AI–

Yeşim Ülgen Sönmez,Asaf Varol

doi:10.1016/j.iswa.2024.200351

Yeşim Ülgen Sönmez, Asaf Varol

Open Access

https://doi.org/10.1016/j.iswa.2024.200351

Copy DOI

Export

Save

Cite

Journal: Intelligent Systems with Applications	Publication Date: Mar 11, 2024
Citations: 4	License type: cc-by-nc-nd

Abstract
Full-Text
Similar Papers

Abstract

Listen

In the super smart society (Society 5.0), new and rapid methods are needed for speech recognition, emotion recognition, and speech emotion recognition areas to maximize human-machine or human-computer interaction and collaboration. Speech signal contains much information about the speaker, such as age, sex, ethnicity, health condition, emotion, and thoughts. The field of study which analyzes the mood of the person from the speech is called speech emotion recognition (SER). Classifying the emotions from the speech data is a complicated problem for artificial intelligence, and its sub-discipline, machine learning. Because it is hard to analyze the speech signal which contains various frequencies and characteristics. Speech data are digitized with signal processing methods and speech features are obtained. These features vary depending on the emotions such as sadness, fear, anger, happiness, boredom, confusion, etc. Even though different methods have been developed for determining the audio properties and emotion recognition, the success rate varies depending on the languages, cultures, emotions, and data sets. In speech emotion recognition, there is a need for new methods which can be applied in data sets with different sizes, which will increase classification success, in which best properties can be obtained, and which are affordable. The success rates are affected by many factors such as the methods used, lack of speech emotion datasets, the homogeneity of the database, the difficulty of the language (linguistic differences), the noise in audio data and the length of the audio data. Within the scope of this study, studies on emotion recognition from speech signals from past to present have been analyzed in detail. In this study, classification studies based on a discrete emotion model using speech data belonging to the Berlin emotional database (EMO-DB), Italian emotional speech database (EMOVO), The Surrey audio-visual expressed emotion database (SAVEE), Ryerson Audio-Visual Database of Emotional Speech and Song Database (RAVDESS), which are mostly independent of the speaker and content, are examined. The results of both classical classifiers and deep learning methods are compared. Deep learning results are more successful, but classical classification is more important in determining the defining features of speech, song or voice. So It develops feature extraction stage. This study will be able to contribute to the literature and help the researchers in the SER field.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

In-depth investigation of speech emotion recognition studies from past to present –The importance of emotion recognition from speech signal for AI–

Abstract

Published Version

Talk to us

Similar Papers

More From: Intelligent Systems with Applications

Lead the way for us

Similar Papers

Robust emotion recognition in noisy speech via sparse representation
Xiaoming Zhao ... Shiqing Zhang
Neural Computing and Applications | VOL. 24
Xiaoming Zhao, et. al.Xiaoming Zhao ... Shiqing Zhang
29 Mar 2013
Neural Computing and Applications | VOL. 24

Time Dependent ARMA for Automatic Recognition of Fear-Type Emotions in Speech
J C Vásquez-Correa ... J D Arias-Londoño
-
J C Vásquez-Correa, et. al.J C Vásquez-Correa ... J D Arias-Londoño
01 Jan 2015
01 Jan 2015

Progress in speech emotion recognition
Xueying Zhang ... Shufei Duan
-
Xueying Zhang, et. al.Xueying Zhang ... Shufei Duan
01 Nov 2015
01 Nov 2015

Children’s recognition of emotion in music and speech
Dianna Vidas ... Nicole L Nelson
Music & Science | VOL. 1
Dianna Vidas, et. al.Dianna Vidas ... Nicole L Nelson
01 Jan 2018
Music & Science | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

In-depth investigation of speech emotion recognition studies from past to present –The importance of emotion recognition from speech signal for AI–

Abstract

Published Version

Talk to us

Similar Papers

More From: Intelligent Systems with Applications