Minimum word error training of long short-term memory recurrent neural network language models for speech recognition

Takaaki Hori,Shinji Watanabe,John R. Hershey,Chiori Hori

doi:10.1109/icassp.2016.7472827

Abstract

This paper describes minimum word error (MWE) training of recurrent neural network language models (RNNLMs) for speech recognition. RNNLMs are usually trained to minimize a cross entropy of estimated word probabilities against the correct word sequence, which corresponds to maximum likelihood criterion. However, this training does not necessarily maximize a performance measure in a target task, i.e. it does not minimize word error rate (WER) explicitly in speech recognition. To solve such a problem, several discriminative training methods have already been proposed for n-gram language models, but those for RNNLMs have not sufficiently investigated. In this paper, we propose a MWE training method for RNNLMs, and report significant WER reductions when we applied the MWE method to a standard Elman-type RNNLM and a more advanced model, a Long Short-Term Memory (LSTM) RNNLM. We also present efficient MWE training with N-best lists on Graphics Processing Units (GPUs).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Minimum word error training of long short-term memory recurrent neural network language models for speech recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Bidirectional recurrent neural network language models for automatic speech recognition
Ebru Arisoy ... Stanley Chen
-
Ebru Arisoy, et. al.Ebru Arisoy ... Stanley Chen
01 Apr 2015
01 Apr 2015

Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition
X Chen ... X Liu
-
X Chen, et. al.X Chen ... X Liu
20 Aug 2017
20 Aug 2017

Lattice rescoring strategies for long short term memory language models in speech recognition
Shankar Kumar ... Michael Nirschl
-
Shankar Kumar, et. al.Shankar Kumar ... Michael Nirschl
01 Dec 2017
01 Dec 2017

Joint unsupervised adaptation of n-gram and RNN language models via LDA-based hybrid mixture modeling
Ryo Masumura ... Taichi Asami
-
Ryo Masumura, et. al.Ryo Masumura ... Taichi Asami
01 Dec 2017
01 Dec 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Minimum word error training of long short-term memory recurrent neural network language models for speech recognition

Abstract

Talk to us

Similar Papers