Dual Script E2E Framework for Multilingual and Code-Switching ASR

Mari Ganesh Kumar,Lodagala V.S.V Durga Prasad,Saish Jaiswal,Anusha Prakash,Ashish Seth,Hema A Murthy,Anand Thyagachandran,Jom Kuriakose,Arun Kumar A

doi:10.21437/interspeech.2021-978

Abstract

India is home to multiple languages, and training automatic speech recognition (ASR) systems for languages is challenging. Over time, each language has adopted words from other languages, such as English, leading to code-mixing. Most Indian languages also have their own unique scripts, which poses a major limitation in training multilingual and code-switching ASR systems. Inspired by results in text-to-speech synthesis, in this work, we use an in-house rule-based phoneme-level common label set (CLS) representation to train multilingual and code-switching ASR for Indian languages. We propose two end-to-end (E2E) ASR systems. In the first system, the E2E model is trained on the CLS representation, and we use a novel data-driven back-end to recover the native language script. In the second system, we propose a modification to the E2E model, wherein the CLS representation and the native language characters are used simultaneously for training. We show our results on the multilingual and code-switching tasks of the Indic ASR Challenge 2021. Our best results achieve 6% and 5% improvement (approx) in word error rate over the baseline system for the multilingual and code-switching tasks, respectively, on the challenge development data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dual Script E2E Framework for Multilingual and Code-Switching ASR

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An Investigation of Multilingual TDNN-BLSTM Acoustic Modeling for Hindi Speech Recognition
Ankit Kumar ... Rajesh Kumar Aggarwal
International Journal of Sensors, Wireless Communications and Control | VOL. 12
Ankit Kumar, et. al.Ankit Kumar ... Rajesh Kumar Aggarwal
01 Jan 2021
International Journal of Sensors, Wireless Communications and Control | VOL. 12

Dynamic Pronunciation Modelling for Unsupervised Learning of ASR Systems
Akella Amarendra Babu ... Ananda Rao Akepogu
IETE Journal of Research | VOL. 62
Akella Amarendra Babu, et. al.Akella Amarendra Babu ... Ananda Rao Akepogu
26 Apr 2016
IETE Journal of Research | VOL. 62

ASR Error Correction and Domain Adaptation Using Machine Translation
Anirudh Mani ... Florian Metze
-
Anirudh Mani, et. al.Anirudh Mani ... Florian Metze
01 May 2020
01 May 2020

Theoretical Analysis of Diversity in an Ensemble of Automatic Speech Recognition Systems
Kartik Audhkhasi ... Shrikanth S Narayanan
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22
Kartik Audhkhasi, et. al.Kartik Audhkhasi ... Shrikanth S Narayanan
01 Mar 2014
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dual Script E2E Framework for Multilingual and Code-Switching ASR

Abstract

Talk to us

Similar Papers