Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition

Xie Chen,Mark J F Gales,Yongqiang Wang,Xunying Liu,Philip C Woodland

doi:10.1109/taslp.2016.2598304

Abstract

Recurrent neural network language models RNNLMs are becoming increasingly popular for a range of applications including automatic speech recognition. An important issue that limits their possible application areas is the computational cost incurred in training and evaluation. This paper describes a series of new efficiency improving approaches that allows RNNLMs to be more efficiently trained on graphics processing units GPUs and evaluated on CPUs. First, a modified RNNLM architecture with a nonclass-based, full output layer structure F-RNNLM is proposed. This modified architecture facilitates a novel spliced sentence bunch mode parallelization of F-RNNLM training using large quantities of data on a GPU. Second, two efficient RNNLM training criteria based on variance regularization and noise contrastive estimation are explored to specifically reduce the computation associated with the RNNLM output layer softmax normalisation term. Finally, a pipelined training algorithm utilizing multiple GPUs is also used to further improve the training speed. Initially, RNNLMs were trained on a moderate dataset with 20M words from a large vocabulary conversational telephone speech recognition task. The training time of RNNLM is reduced by up to a factor of 53 on a single GPU over the standard CPU-based RNNLM toolkit. A 56 times speed up in test time evaluation on a CPU was obtained over the baseline F-RNNLMs. Consistent improvements in both recognition accuracy and perplexity were also obtained over C-RNNLMs. Experiments on Google's one billion corpus also reveals that the training of RNNLM scales well.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Nov 1, 2016
Citations: 60	License type: mit

R Discovery Prime

R Discovery Prime

Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Similar Papers

Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch
X Chen ... Mark J F Gales
-
X Chen, et. al.X Chen ... Mark J F Gales
14 Sep 2014
14 Sep 2014

Recurrent neural network language model training with noise contrastive estimation for speech recognition
X Chen ... X Liu
-
X Chen, et. al.X Chen ... X Liu
01 Apr 2015
01 Apr 2015

Joint unsupervised adaptation of n-gram and RNN language models via LDA-based hybrid mixture modeling
Ryo Masumura ... Taichi Asami
-
Ryo Masumura, et. al.Ryo Masumura ... Taichi Asami
01 Dec 2017
01 Dec 2017

Improving the training and evaluation efficiency of recurrent neural network language models
X Chen ... M.J.F Gales
-
X Chen, et. al.X Chen ... M.J.F Gales
01 Apr 2015
01 Apr 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing