Comparison of Various Neural Network Language Models in Speech Recognition

Lingyun Zuo,Jian Liu,Xin Wan

doi:10.1109/icisce.2016.195

Abstract

In recent years, research on language modeling for speech recognition has increasingly focused on the application of neural networks. However, the performance of neural network language models strongly depends on their architectural structure. Three competing concepts have been developed: Firstly, feed forward neural networks representing an n-gram approach, Secondly, recurrent neural networks that may learn context dependencies spanning more than a fixed number of predecessor words, Thirdly, the long short-term memory (LSTM) neural networks can fully exploits the correlation on a telephone conversation corpus. In this paper, we compare count models to feed forward, recurrent, and LSTM neural network in conversational telephone speech recognition tasks. Furthermore, we put forward a language model estimation method introduced the information of history sentences. We evaluate the models in terms of perplexity and word error rate, experimentally validating the strong correlation of the two quantities, which we find to hold regardless of the underlying type of the language model. The experimental results show that the performance of LSTM neural network language model is optimal in n-best lists re-score. Compared to the first pass decoding, the relative decline in average word error rate is 4.3% when using ten candidate results to re-score in conversational telephone speech recognition tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison of Various Neural Network Language Models in Speech Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Bidirectional recurrent neural network language models for automatic speech recognition
Ebru Arisoy ... Stanley Chen
-
Ebru Arisoy, et. al.Ebru Arisoy ... Stanley Chen
01 Apr 2015
01 Apr 2015

Lattice rescoring strategies for long short term memory language models in speech recognition
Shankar Kumar ... Michael Nirschl
-
Shankar Kumar, et. al.Shankar Kumar ... Michael Nirschl
01 Dec 2017
01 Dec 2017

Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition
X Chen ... X Liu
-
X Chen, et. al.X Chen ... X Liu
20 Aug 2017
20 Aug 2017

Minimum word error training of long short-term memory recurrent neural network language models for speech recognition
Takaaki Hori ... Chiori Hori
-
Takaaki Hori, et. al.Takaaki Hori ... Chiori Hori
01 Mar 2016
01 Mar 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of Various Neural Network Language Models in Speech Recognition

Abstract

Talk to us

Similar Papers