Emotional Speaker Recognition based on Machine and Deep Learning

Tshephisho Joseph Sefara,Tumisho Billson Mokgonyane

doi:10.1109/imitec50163.2020.9334138

Abstract

Speaker recognition is a method which recognise a speaker from characteristics of a voice. Speaker recognition technologies have been widely used in many domains. Most speaker recognition systems have been trained on normal clean recordings, however the performance of these speaker recognition systems tends to degrade when recognising speech which has emotions. This paper presents an emotional speaker recognition system trained using machine and deep learning algorithms using time, frequency and spectral features on emotional speech database acquired from the Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS). We trained and compared the performance of five machine learning models (Logistic Regression, Support Vector Machine, Random Forest, XGBoost, and k-Nearest Neighbor), and three deep learning models (Long Short-Term Memory network, Multilayer Perceptron, and Convolutional Neural Network). After the evaluation of the models, the deep neural networks showed good performance compared to machine learning models by attaining the highest accuracy of 92% outperforming the state-of-the-art models in emotional speaker detection from speech signals.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Emotional Speaker Recognition based on Machine and Deep Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A hybrid CNN and ensemble model for COVID-19 lung infection detection on chest CT scans.
Ahmed A Akl ... Ahmad Salah
PLOS ONE | VOL. 18
Ahmed A Akl, et. al.Ahmed A Akl ... Ahmad Salah
09 Mar 2023
PLOS ONE | VOL. 18

Recognition of Emotion with Intensity from Speech Signal Using 3D Transformed Feature and Deep Learning
Md Riadul Islam ... M A H Akhand
Electronics | VOL. 11
Md Riadul Islam, et. al.Md Riadul Islam ... M A H Akhand
28 Jul 2022
Electronics | VOL. 11

Explainable artificial intelligence (XAI) for predicting the need for intubation in methanol-poisoned patients: a study comparing deep and machine learning models
Khadijeh Moulaei ... Mitra Rahimi
Scientific Reports | VOL. 14
Khadijeh Moulaei, et. al.Khadijeh Moulaei ... Mitra Rahimi
08 Jul 2024
Scientific Reports | VOL. 14

Mapping wetland habitat health in moribund deltaic India using machine learning and deep learning algorithms
Satyajit Paul ... Swades Pal
Ecohydrology & Hydrobiology | VOL. 24
Satyajit Paul, et. al.Satyajit Paul ... Swades Pal
01 Mar 2024
Ecohydrology & Hydrobiology | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Emotional Speaker Recognition based on Machine and Deep Learning

Abstract

Talk to us

Similar Papers