Recurrent neural network language model training with noise contrastive estimation for speech recognition

X Chen,P C Woodland,M J F Gales,X Liu

doi:10.1109/icassp.2015.7179005

Abstract

In recent years recurrent neural network language models (RNNLMs) have been successfully applied to a range of tasks including speech recognition. However, an important issue that limits the quantity of data used, and their possible application areas, is the computational cost in training. A signi??cant part of this cost is associated with the softmax function at the output layer, as this requires a normalization term to be explicitly calculated. This impacts both the training and testing speed, especially when a large output vocabulary is used. To address this problem, noise contrastive estimation (NCE) is explored in RNNLM training. NCE does not require the above normalization during both training and testing. It is insensitive to the output layer size. On a large vocabulary conversational telephone speech recognition task, a doubling in training speed on a GPU and a 56 times speed up in test time evaluation on a CPU were obtained.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Recurrent neural network language model training with noise contrastive estimation for speech recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch
X Chen ... Mark J F Gales
-
X Chen, et. al.X Chen ... Mark J F Gales
14 Sep 2014
14 Sep 2014

Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition
Xie Chen ... Philip C Woodland
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24
Xie Chen, et. al.Xie Chen ... Philip C Woodland
01 Nov 2016
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24

Joint unsupervised adaptation of n-gram and RNN language models via LDA-based hybrid mixture modeling
Ryo Masumura ... Taichi Asami
-
Ryo Masumura, et. al.Ryo Masumura ... Taichi Asami
01 Dec 2017
01 Dec 2017

Training RNN language models on uncertain ASR hypotheses in limited data scenarios
Imran Sheikh ... Irina Illina
Computer Speech & Language | VOL. 83
Imran Sheikh, et. al.Imran Sheikh ... Irina Illina
20 Aug 2023
Computer Speech & Language | VOL. 83

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recurrent neural network language model training with noise contrastive estimation for speech recognition

Abstract

Talk to us

Similar Papers