Neural candidate-aware language models for speech recognition

Tomohiro Tanaka,Ryo Masumura,Takanobu Oba

doi:10.1016/j.csl.2020.101157

Abstract

This paper presents novel neural network based language models that can correct automatic speech recognition (ASR) errors by using speech recognizer outputs as a context. Our proposed models, called neural candidate-aware language models (NCALMs), estimate the generative probability of a target sentence while considering ASR outputs including hypotheses and their posterior probabilities. Recently, neural network language models have achieved great success in ASR field because of their ability to learn long-range contexts and model the word representation in continuous space. However, they estimate a sentence probability without considering other candidates and their posterior probabilities, even though the competing hypotheses are available and include important information to increase the speech recognition accuracy. To overcome this limitation, our idea is to utilize ASR outputs in both the training phase and the inference phase. Our proposed models are conditional generative models consisting of a Transformer encoder and a Transformer decoder. The encoder embeds the candidates as context vectors and the decoder estimates a sentence probability given the context vectors. We evaluate the proposed models in Japanese lecture transcription and English conversational speech recognition tasks. Experimental results show that a NCALM has better ASR performance than a system including a deep neural network-hidden Markov model hybrid system. We further improve ASR performance by using a NCALM and a Transformer language model simultaneously.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Neural candidate-aware language models for speech recognition

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Journal: Computer Speech & Language	Publication Date: Sep 24, 2020
Citations: 3

Similar Papers

Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition
X Chen ... X Liu
-
X Chen, et. al.X Chen ... X Liu
20 Aug 2017
20 Aug 2017

Bag-of-words input for long history representation in neural network-based language models for speech recognition
Kazuki Irie ... Ralf Schlüter
-
Kazuki Irie, et. al.Kazuki Irie ... Ralf Schlüter
06 Sep 2015
06 Sep 2015

Improving ASR Error Detection with RNNLM Adaptation
Rahhal Errattahi ... Hassan Ouahmane
-
Rahhal Errattahi, et. al.Rahhal Errattahi ... Hassan Ouahmane
01 Dec 2018
01 Dec 2018

Comparison of Various Neural Network Language Models in Speech Recognition
Lingyun Zuo ... Xin Wan
-
Lingyun Zuo, et. al.Lingyun Zuo ... Xin Wan
01 Jul 2016
01 Jul 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neural candidate-aware language models for speech recognition

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language