Multilingual end-to-end ASR for low-resource Turkic languages with common alphabets

Akbayan Bekarystankyzy,Orken Mamyrbayev,Mateus Mendes,Anar Fazylzhanova,Muhammad Assam

doi:10.1038/s41598-024-64848-1

Abstract

To obtain a reliable and accurate automatic speech recognition (ASR) machine learning model, it is necessary to have sufficient audio data transcribed, for training. Many languages in the world, especially the agglutinative languages of the Turkic family, suffer from a lack of this type of data. Many studies have been conducted in order to obtain better models for low-resource languages, using different approaches. The most popular approaches include multilingual training and transfer learning. In this study, we combined five agglutinative languages from the Turkic family—Kazakh, Bashkir, Kyrgyz, Sakha, and Tatar,—in order to provide multilingual training using connectionist temporal classification and an attention mechanism including a language model, because these languages have cognate words, sentence formation rules, and alphabet (Cyrillic). Data from the open-source database Common voice was used for the study, to make the experiments reproducible. The results of the experiments showed that multilingual training could improve ASR performances for all languages included in the experiment, except Bashkir language. A dramatic result was achieved for the Kyrgyz language: word error rate decreased to nearly one-fifth and character error rate decreased to one-fourth, which proves that this approach can be helpful for critically low-resource languages.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multilingual end-to-end ASR for low-resource Turkic languages with common alphabets

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Journal: Scientific Reports	Publication Date: Jun 15, 2024
License type: cc-by

Similar Papers

Detecting cyberbullying text using the approaches with machine learning models for the low-resource Bengali language
Md Nesarul Hoque ... Md Hanif Seddiqui
IAES International Journal of Artificial Intelligence (IJ-AI) | VOL. 13
Md Nesarul Hoque, et. al.Md Nesarul Hoque ... Md Hanif Seddiqui
01 Mar 2024
IAES International Journal of Artificial Intelligence (IJ-AI) | VOL. 13

Evaluation of Neural Network Transformer Models for Named-Entity Recognition on Low-Resourced Languages
Ridewaan Hanslo
-
Ridewaan HansloRidewaan Hanslo
26 Sep 2021
26 Sep 2021

Transfer Learning, Style Control, and Speaker Reconstruction Loss for Zero-Shot Multilingual Multi-Speaker Text-to-Speech on Low-Resource Languages
Kurniawati Azizah ... Wisnu Jatmiko
IEEE Access | VOL. 10
Kurniawati Azizah, et. al.Kurniawati Azizah ... Wisnu Jatmiko
01 Jan 2021
IEEE Access | VOL. 10

Analytical Review of Methods for Solving Data Scarcity Issues Regarding Elaboration of Automatic Speech Recognition Systems for Low-Resource Languages
Irina Kipyatkova ... Ildar Kagirov
Информатика и автоматизация | VOL. 21
Irina Kipyatkova, et. al.Irina Kipyatkova ... Ildar Kagirov
08 Jul 2022
Информатика и автоматизация | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multilingual end-to-end ASR for low-resource Turkic languages with common alphabets

Abstract

Talk to us

Similar Papers

More From: Scientific Reports