Continuous space language models

Holger Schwenk

doi:10.1016/j.csl.2006.09.003

Abstract

This paper describes the use of a neural network language model for large vocabulary continuous speech recognition. The underlying idea of this approach is to attack the data sparseness problem by performing the language model probability estimation in a continuous space. Highly efficient learning algorithms are described that enable the use of training corpora of several hundred million words. It is also shown that this approach can be incorporated into a large vocabulary continuous speech recognizer using a lattice rescoring framework at a very low additional processing time. The neural network language model was thoroughly evaluated in a state-of-the-art large vocabulary continuous speech recognizer for several international benchmark tasks, in particular the N ist evaluations on broadcast news and conversational speech recognition. The new approach is compared to four-gram back-off language models trained with modified Kneser–Ney smoothing which has often been reported to be the best known smoothing method. Usually the neural network language model is interpolated with the back-off language model. In that way, consistent word error rate reductions for all considered tasks and languages were achieved, ranging from 0.4% to almost 1% absolute.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Continuous space language models

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Journal: Computer Speech & Language	Publication Date: Oct 9, 2006
Citations: 510

Similar Papers

Converting Neural Network Language Models into Back-off Language Models for Efficient Decoding in Automatic Speech Recognition
Ebru Arisoy ... Abhinav Sethy
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22
Ebru Arisoy, et. al.Ebru Arisoy ... Abhinav Sethy
01 Jan 2014
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22

Converting Neural Network Language Models into back-off language models for efficient decoding in automatic speech recognition
Ebru Arisoy ... Abhinav Sethy
-
Ebru Arisoy, et. al.Ebru Arisoy ... Abhinav Sethy
01 May 2013
01 May 2013

Comparison of Various Neural Network Language Models in Speech Recognition
Lingyun Zuo ... Jian Liu
-
Lingyun Zuo, et. al.Lingyun Zuo ... Jian Liu
01 Jul 2016
01 Jul 2016

Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition
X Chen ... A Ragni
-
X Chen, et. al.X Chen ... A Ragni
20 Aug 2017
20 Aug 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Continuous space language models

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language