Integrated End-to-End Automatic Speech Recognition for Languages for Agglutinative Languages

Akbayan Bekarystankyzy,Orken Mamyrbayev,Tolganay Anarbekova

doi:10.1145/3663568

Abstract

The relevance of the problem of automatic speech recognition lies in the lack of research for low-resource languages, stemming from limited training data and the necessity for new technologies to enhance efficiency and performance. The purpose of this work was to study the main aspects of integrated end-to-end speech recognition and the use of modern technologies in the natural processing of agglutinative languages, including Kazakh. In this article, the study of language models was carried out using comparative, graphic, statistical, and analytical-synthetic methods, which were used in combination. This article addresses automatic speech recognition (ASR) in agglutinative languages, particularly Kazakh, through a unified neural network model that integrates both acoustic and language modeling. Employing advanced techniques like connectionist temporal classification and attention mechanisms, the study focuses on effective speech-to-text transcription for languages with complex morphologies. Transfer learning from high-resource languages helps mitigate data scarcity in languages such as Kazakh, Kyrgyz, Uzbek, Turkish, and Azerbaijani. The research assesses model performance, underscores ASR challenges, and proposes advancements for these languages. It includes a comparative analysis of phonetic and word-formation features in agglutinative Turkic languages, using statistical data. The findings aid further research in linguistics and technology for enhancing speech recognition and synthesis, contributing to voice identification and automation processes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Integrated End-to-End Automatic Speech Recognition for Languages for Agglutinative Languages

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Jun 21, 2024
License type: mit

Similar Papers

Statistical Language Modeling for Automatic Speech Recognition of Agglutinative Languages
Ebru Arsoy ... Tanel Alume
-
Ebru Arsoy, et. al.Ebru Arsoy ... Tanel Alume
01 Nov 2008
01 Nov 2008

ЕND-TO-END SPEECH RECOGNITION SYSTEMS FOR AGGLUTINATIVE LANGUAGES
Akbayan Bekarystankyzy ... Orken Mamyrbayev
Scientific Journal of Astana IT University | VOL. -
Akbayan Bekarystankyzy, et. al.Akbayan Bekarystankyzy ... Orken Mamyrbayev
30 Mar 2023
Scientific Journal of Astana IT University | VOL. -

Identifying the influence of transfer learning method in developing an end-to-end automatic speech recognition system with a low data level
Orken Mamyrbayev ... Keylan Alimhan
Eastern-European Journal of Enterprise Technologies | VOL. 1
Orken Mamyrbayev, et. al.Orken Mamyrbayev ... Keylan Alimhan
28 Feb 2022
Eastern-European Journal of Enterprise Technologies | VOL. 1

Incorporating language constraints in sub-word based speech recognition
H Erdogan ... O Buyuk
-
H Erdogan, et. al.H Erdogan ... O Buyuk
01 Jan 2004
01 Jan 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Integrated End-to-End Automatic Speech Recognition for Languages for Agglutinative Languages

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing