Scalable Multi Corpora Neural Language Models for ASR

Anirudh Raju,Guitang Lan,Ariya Rastrow,Gautam Tiwari,Denis Filimonov

doi:10.21437/interspeech.2019-3060

Scalable Multi Corpora Neural Language Models for ASR

Anirudh Raju, Guitang Lan + Show 3 more

Open Access

https://doi.org/10.21437/interspeech.2019-3060

Copy DOI

Publication Date: Sep 15, 2019

Citations: 19

#Neural Language Models #Relative WER Reduction + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Neural language models (NLM) have been shown to outperform conventional n-gram language models by a substantial margin in Automatic Speech Recognition (ASR) and other tasks. There are, however, a number of challenges that need to be addressed for an NLM to be used in a practical large-scale ASR system. In this paper, we present solutions to some of the challenges, including training NLM from heterogenous corpora, limiting latency impact and handling personalized bias in the second-pass rescorer. Overall, we show that we can achieve a 6.2% relative WER reduction using neural LM in a second-pass n-best rescoring framework with a minimal increase in latency.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.