Improving the training and evaluation efficiency of recurrent neural network language models

X Chen,X Liu,P C Woodland,M.J.F Gales

doi:10.1109/icassp.2015.7179003

Abstract

Recurrent neural network language models (RNNLMs) are becoming increasingly popular for speech recognition. Previously, we have shown that RNNLMs with a full (non-classed) output layer (F-RNNLMs) can be trained efficiently using a GPU giving a large reduction in training time over conventional class-based models (C-RNNLMs) on a standard CPU. However, since test-time RNNLM evaluation is often performed entirely on a CPU, standard F-RNNLMs are inefficient since the entire output layer needs to be calculated for normalisation. In this paper, it is demonstrated that C-RNNLMs can be efficiently trained on a GPU, using our spliced sentence bunch technique which allows good CPU test-time performance (42× speedup over F-RNNLM). Furthermore, the performance of different classing approaches is investigated. We also examine the use of variance regularisation of the softmax denominator for F-RNNLMs and show that it allows F-RNNLMs to be efficiently used in test (56× speedup on a CPU). Finally the use of two GPUs for F-RNNLM training using pipelining is described and shown to give a reduction in training time over a single GPU by a factor of 1.6×.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving the training and evaluation efficiency of recurrent neural network language models

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition
Xie Chen ... Philip C Woodland
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24
Xie Chen, et. al.Xie Chen ... Philip C Woodland
01 Nov 2016
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24

Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch
X Chen ... Mark J F Gales
-
X Chen, et. al.X Chen ... Mark J F Gales
14 Sep 2014
14 Sep 2014

CUED-RNNLM — An open-source toolkit for efficient training and evaluation of recurrent neural network language models
X Chen ... Y Qian
-
X Chen, et. al.X Chen ... Y Qian
01 Mar 2016
01 Mar 2016

Minimum word error training of long short-term memory recurrent neural network language models for speech recognition
Takaaki Hori ... Chiori Hori
-
Takaaki Hori, et. al.Takaaki Hori ... Chiori Hori
01 Mar 2016
01 Mar 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving the training and evaluation efficiency of recurrent neural network language models

Abstract

Talk to us

Similar Papers