Speaker-based language identification for Ethio-Semitic languages using CRNN and hybrid features

Malefia Demilie Melese,Amlakie Aschale Alemu,Ayodeji Olalekan Salau,Ibrahim Gashaw Kasa

doi:10.1080/0954898x.2024.2359610

Abstract

ABSTRACT Natural language is frequently employed for information exchange between humans and computers in modern digital environments. Natural Language Processing (NLP) is a basic requirement for technological advancement in the field of speech recognition. For additional NLP activities like speech-to-text translation, speech-to-speech translation, speaker recognition, and speech information retrieval, language identification (LID) is a prerequisite. In this paper, we developed a Language Identification (LID) model for Ethio-Semitic languages. We used a hybrid approach (a convolutional recurrent neural network (CRNN)), in addition to a mixed (Mel frequency cepstral coefficient (MFCC) and mel-spectrogram) approach, to build our LID model. The study focused on four Ethio-Semitic languages: Amharic, Ge’ez, Guragigna, and Tigrinya. By using data augmentation for the selected languages, we were able to expand our original dataset of 8 h of audio data to 24 h and 40 min. The proposed selected features, when evaluated, achieved an average performance accuracy of 98.1%, 98.6%, and 99.9% for testing, validation, and training, respectively. The results show that the CRNN model with (Mel-Spectrogram + MFCC) combination feature achieved the best results when compared to other existing models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speaker-based language identification for Ethio-Semitic languages using CRNN and hybrid features

Abstract

Talk to us

Similar Papers

More From: Network: Computation in Neural Systems

Lead the way for us

Journal: Network: Computation in Neural Systems	Publication Date: Jun 6, 2024
Citations: 1

Similar Papers

Identification of Indian classical languages using Convolutional Recurrent Neural Networks
Ibin Oommen ... Anu George
-
Ibin Oommen, et. al.Ibin Oommen ... Anu George
18 Dec 2020
18 Dec 2020

Multilingual Speech Corpus in Low-Resource Eastern and Northeastern Indian Languages for Speaker and Language Identification
Joyanta Basu ... Tapan Kumar Basu
Circuits, Systems, and Signal Processing | VOL. 40
Joyanta Basu, et. al.Joyanta Basu ... Tapan Kumar Basu
20 Apr 2021
Circuits, Systems, and Signal Processing | VOL. 40

Is Attention Always Needed? A Case Study on Language Identification from Speech
Atanu Mandal ... Santanu Pal
SSRN Electronic Journal | VOL. -
Atanu Mandal, et. al.Atanu Mandal ... Santanu Pal
01 Jan 2021
SSRN Electronic Journal | VOL. -

Information Extraction from Product Labels: A Machine Vision Approach
Hansi Seitaj ... Vinayak Elangovan
International Journal of Artificial Intelligence & Applications | VOL. 15
Hansi Seitaj, et. al.Hansi Seitaj ... Vinayak Elangovan
29 Mar 2024
International Journal of Artificial Intelligence & Applications | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speaker-based language identification for Ethio-Semitic languages using CRNN and hybrid features

Abstract

Talk to us

Similar Papers

More From: Network: Computation in Neural Systems