End-to-End Multilingual Automatic Speech Recognition for Less-Resourced Languages: The Case of Four Ethiopian Languages

Solomon Teferra Abate,Martha Yifiru Tachbelie,Tanja Schultz

doi:10.1109/icassp39728.2021.9415020

Abstract

The End-to-End (E2E) approach, which maps a sequence of input features into a sequence of graphemes or words, to Automatic Speech Recognition (ASR) is a hot research agenda. It is interesting for less-resourced languages since it avoids the use of pronunciation dictionary, which is one of the major components in the traditional ASR systems. However, like any deep neural network (DNN) approaches, E2E is data greedy. This makes the application of E2E to less-resourced languages questionable. However, using data from other languages in a multilingual (ML) setup is being applied to solve the problem of data scarcity. We have, therefore, conducted ML E2E ASR experiments for four less-resourced Ethiopian languages using different language and acoustic modelling units. The results of our experiments show that relative Word Error Rate (WER) reductions (over the monolingual E2E systems) of up to 29.83% can be achieved by just using data of two related languages in E2E ASR system training. Moreover, we have also noticed that the use of data from less related languages also leads to E2E ASR performance improvement over the use of monolingual data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

End-to-End Multilingual Automatic Speech Recognition for Less-Resourced Languages: The Case of Four Ethiopian Languages

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

DNN-based Multilingual Acoustic Modeling for Four Ethiopian Languages
Solomon Teferra ... Martha Yifiru
SINET: Ethiopian Journal of Science | VOL. 46
Solomon Teferra, et. al.Solomon Teferra ... Martha Yifiru
27 Mar 2024
SINET: Ethiopian Journal of Science | VOL. 46

Multilingual speech recognition for GlobalPhone languages
Martha Yifiru Tachbelie ... Tanja Schultz
Speech Communication | VOL. 140
Martha Yifiru Tachbelie, et. al.Martha Yifiru Tachbelie ... Tanja Schultz
26 Mar 2022
Speech Communication | VOL. 140

Theoretical Analysis of Diversity in an Ensemble of Automatic Speech Recognition Systems
Kartik Audhkhasi ... Shrikanth S Narayanan
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22
Kartik Audhkhasi, et. al.Kartik Audhkhasi ... Shrikanth S Narayanan
01 Mar 2014
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22

Subband Temporal Envelope Features and Data Augmentation for End-to-end Recognition of Distant Conversational Speech
Cong-Thanh Do
-
Cong-Thanh DoCong-Thanh Do
01 May 2019
01 May 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

End-to-End Multilingual Automatic Speech Recognition for Less-Resourced Languages: The Case of Four Ethiopian Languages

Abstract

Talk to us

Similar Papers