Leveraging Native Language Information for Improved Accented Speech Recognition

Shahram Ghorbani,John H.L Hansen

doi:10.21437/interspeech.2018-1378

Abstract

Recognition of accented speech is a long-standing challenge for automatic speech recognition (ASR) systems, given the increasing worldwide population of bi-lingual speakers with English as their second language. If we consider foreign-accented speech as an interpolation of the native language (L1) and English (L2), using a model that can simultaneously address both languages would perform better at the acoustic level for accented speech. In this study, we explore how an end-to-end recurrent neural network (RNN) trained system with English and native languages (Spanish and Indian languages) could leverage data of native languages to improve performance for accented English speech. To this end, we examine pre-training with native languages, as well as multi-task learning (MTL) in which the main task is trained with native English and the secondary task is trained with Spanish or Indian Languages. We show that the proposed MTL model performs better than the pre-training approach and outperforms a baseline model trained simply with English data. We suggest a new setting for MTL in which the secondary task is trained with both English and the native language, using the same output set. This proposed scenario yields better performance with +11.95% and +17.55% character error rate gains over baseline for Hispanic and Indian accents, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Leveraging Native Language Information for Improved Accented Speech Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Native Language Identification from Spoken Indian English
...
Trends in Electrical Engineering | VOL. 9
, et. al. ...
30 Oct 2019
Trends in Electrical Engineering | VOL. 9

End-to-End Audiovisual Speech Recognition System With Multitask Learning
Fei Tao ... Carlos Busso
IEEE Transactions on Multimedia | VOL. 23
Fei Tao, et. al.Fei Tao ... Carlos Busso
06 Mar 2020
IEEE Transactions on Multimedia | VOL. 23

An Investigation of Multilingual TDNN-BLSTM Acoustic Modeling for Hindi Speech Recognition
Ankit Kumar ... Rajesh Kumar Aggarwal
International Journal of Sensors, Wireless Communications and Control | VOL. 12
Ankit Kumar, et. al.Ankit Kumar ... Rajesh Kumar Aggarwal
01 Jan 2021
International Journal of Sensors, Wireless Communications and Control | VOL. 12

Dual Script E2E Framework for Multilingual and Code-Switching ASR
Mari Ganesh Kumar ... Arun Kumar A
-
Mari Ganesh Kumar, et. al.Mari Ganesh Kumar ... Arun Kumar A
30 Aug 2021
30 Aug 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Leveraging Native Language Information for Improved Accented Speech Recognition

Abstract

Talk to us

Similar Papers