Speech emotion recognition with light gradient boosting decision trees machine

Kah Liang Ong,Kian Ming Lim,Chin Poo Lee,Heng Siong Lim

doi:10.11591/ijece.v13i4.pp4020-4028

Abstract

<p>Speech emotion recognition aims to identify the emotion expressed in the speech by analyzing the audio signals. In this work, data augmentation is first performed on the audio samples to increase the number of samples for better model learning. The audio samples are comprehensively encoded as the frequency and temporal domain features. In the classification, a light gradient boosting machine is leveraged. The hyperparameter tuning of the light gradient boosting machine is performed to determine the optimal hyperparameter settings. As the speech emotion recognition datasets are imbalanced, the class weights are regulated to be inversely proportional to the sample distribution where minority classes are assigned higher class weights. The experimental results demonstrate that the proposed method outshines the state-of-the-art methods with 84.91% accuracy on the emo-DB dataset, 67.72% on the Ryerson audio-visual database of emotional speech and song (RAVDESS) dataset, and 62.94% on the interactive emotional dyadic motion capture (IEMOCAP) dataset.</p>

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Electrical and Computer Engineering (IJECE)	Publication Date: Aug 1, 2023
Citations: 3	License type: CC BY-SA 4.0

R Discovery Prime

R Discovery Prime

Speech emotion recognition with light gradient boosting decision trees machine

Abstract

Talk to us

Similar Papers

More From: International Journal of Electrical and Computer Engineering (IJECE)

Lead the way for us

Similar Papers

Speech emotion recognition with deep convolutional neural networks
Dias Issa ... Adnan Yazici
Biomedical Signal Processing and Control | VOL. 59
Dias Issa, et. al.Dias Issa ... Adnan Yazici
27 Feb 2020
Biomedical Signal Processing and Control | VOL. 59

A CNN-Assisted Enhanced Audio Signal Processing for Speech Emotion Recognition.
Mustaqeem ... Soonil Kwon
Sensors | VOL. 20
Mustaqeem, et. al. Mustaqeem ... Soonil Kwon
28 Dec 2019
Sensors | VOL. 20

SER: Performance Evaluation of CNN Model Along with an Overview of Available Indic Speech Datasets, and Transition of Classifiers From Traditional to Modern Era
Surbhi Khurana ... Poonam Bansal
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -
Surbhi Khurana, et. al.Surbhi Khurana ... Poonam Bansal
26 Jun 2023
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -

Enhancing speech emotion recognition with deep learning using multi-feature stacking and data augmentation
Khasyi Al Mukarram ... Amalia Zahra
Bulletin of Electrical Engineering and Informatics | VOL. 13
Khasyi Al Mukarram, et. al.Khasyi Al Mukarram ... Amalia Zahra
01 Jun 2024
Bulletin of Electrical Engineering and Informatics | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech emotion recognition with light gradient boosting decision trees machine

Abstract

Talk to us

Similar Papers

More From: International Journal of Electrical and Computer Engineering (IJECE)