Deep Learning for Arabic Speech Recognition Using Convolutional Neural Networks

Soufiyan Ouali Soufiyan Ouali

doi:10.52783/jes.3319

Abstract

Extracting the speaker's emotional state from their speech has become an active research topic lately due to the demand for more human interactive applications. This field of research has noted significant advancement, especially in the English language, owing to the availability of massive speech-labeled corpora. However, the progress of analogous methodologies in the Arabic language is still in its infancy stages. In this paper, we present a Speech Recognition model for the Arabic language, proficient in discerning both the emotional state and gender of the speaker through voice analysis. Three primary emotion labels were selected: low, standard, and high levels of emotion. Various spectral features, such as the mel-frequency cepstral coefficient (MFCC), were extracted and tested to determine the optimal features. Furthermore, various Machine Learning models (SVM, KNN, and HMM) and Deep Learning models (LSTM and CNN) were evaluated for training. The results were compared between the five models using different extracted features, ultimately culminating in the selection of MFCC, root-mean-square (RMS), mel-scaled spectrogram, spectral, and zero-crossing rate as spectral features, and the CNN as a classification model. This selection yielded significant results, with an accuracy of 93% for emotion recognition and 99% for gender recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Learning for Arabic Speech Recognition Using Convolutional Neural Networks

Abstract

Talk to us

Similar Papers

More From: Journal of Electrical Systems

Lead the way for us

Journal: Journal of Electrical Systems	Publication Date: Apr 29, 2024
License type: CC BY-ND 4.0

Similar Papers

Emotional speech Recognition using CNN and Deep learning techniques
C Hema ... Fausto Pedro Garcia Marquez
Applied Acoustics | VOL. 211
C Hema, et. al.C Hema ... Fausto Pedro Garcia Marquez
28 Jun 2023
Applied Acoustics | VOL. 211

A hybrid CNN and ensemble model for COVID-19 lung infection detection on chest CT scans.
Ahmed A Akl ... Ahmad Salah
PLOS ONE | VOL. 18
Ahmed A Akl, et. al.Ahmed A Akl ... Ahmad Salah
09 Mar 2023
PLOS ONE | VOL. 18

A novel deep-learning technique for forecasting oil price volatility using historical prices of five precious metals in context of green financing – A comparison of deep learning, machine learning, and statistical models
Muhammad Mohsin ... Fouad Jamaani
Resources Policy | VOL. 86
Muhammad Mohsin, et. al.Muhammad Mohsin ... Fouad Jamaani
01 Oct 2023
Resources Policy | VOL. 86

Comparison of Machine Learning and Deep Learning Models for Network Intrusion Detection Systems
Niraj Thapa ... Balakrishna Gokaraju
Future Internet | VOL. 12
Niraj Thapa, et. al.Niraj Thapa ... Balakrishna Gokaraju
30 Sep 2020
Future Internet | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Learning for Arabic Speech Recognition Using Convolutional Neural Networks

Abstract

Talk to us

Similar Papers

More From: Journal of Electrical Systems