Improved mixed language speech recognition using asymmetric acoustic model and language model with code-switch inversion constraints

Ying Li,Pascale Fung

doi:10.1109/icassp.2013.6639094

Abstract

We propose an integrated framework for large vocabulary continuous mixed language speech recognition that handles the accent effect in the bilingual acoustic model and the inversion constraint well known to linguists in the language model. Our asymmetric acoustic model with phone set extension improves upon previous work by striking a balance between data and phonetic knowledge. Our language model improves upon previous work by (1) using the inversion constraint to predict code switching points in the mixed language and (2) integrating a code-switch prediction model, a translation model and a reconstruction model together. This integration means that our language model avoids the pitfall of propagated error that could arise from decoupling these steps. Finally, a WFST-based decoder integrates the acoustic models, code-switch language model and a monolingual language model in the matrix language all together. Our system reduces word error rate by 1.88% on a lecture speech corpus and by 2.43% on a lunch conversation corpus, with statistical significance, over the conventional bilingual acoustic model and interpolated language model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improved mixed language speech recognition using asymmetric acoustic model and language model with code-switch inversion constraints

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Code switch language modeling with Functional Head Constraint
Ying Li ... Pascale Fung
-
Ying Li, et. al.Ying Li ... Pascale Fung
01 May 2014
01 May 2014

Code-Switching Detection with Data-Augmented Acoustic and Language Models
Emre Yilmaz ... Henk Van Den Heuvel
-
Emre Yilmaz, et. al.Emre Yilmaz ... Henk Van Den Heuvel
29 Aug 2018
29 Aug 2018

Acoustic and Textual Data Augmentation for Improved ASR of Code-Switching Speech
Emre Yılmaz ... Henk Van Den Heuvel
-
Emre Yılmaz, et. al.Emre Yılmaz ... Henk Van Den Heuvel
02 Sep 2018
02 Sep 2018

Using different acoustic, lexical and language modeling units for ASR of an under-resourced language – Amharic
Martha Yifiru Tachbelie ... Laurent Besacier
Speech Communication | VOL. 56
Martha Yifiru Tachbelie, et. al.Martha Yifiru Tachbelie ... Laurent Besacier
14 Feb 2013
Speech Communication | VOL. 56

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improved mixed language speech recognition using asymmetric acoustic model and language model with code-switch inversion constraints

Abstract

Talk to us

Similar Papers