Stochastic automata for language modeling

Giuseppe Riccardi,Roberto Pieraccini,Enrico Bocchieri

doi:10.1006/csla.1996.0014

Abstract

Stochastic language models are widely used in spoken language understanding to recognize and interpret the speech signal: the speech samples are decoded into word transcriptions by means of acoustic and syntactic models and then interpreted according to a semantic model. Both for speech recognition and understanding, search algorithms use stochastic models to extract the most likely uttered sentence and its correspondent interpretation. The design of the language models has to be effective in order to mostly constrain the search algorithms and has to be efficient to comply with the storage space limits. In this work we present the Variable N-gram Stochastic Automaton (VNSA) language model that provides a unified formalism for building a wide class of language models. First, this approach allows for the use of accurate language models for large vocabulary speech recognition by using the standard search algorithm in the one-pass Viterbi decoder. Second, the unified formalism is an effective approach to incorporate different sources of information for computing the probability of word sequences. Third, the VNSAs are well suited for those applications where speech and language decoding cascades are implemented through weighted rational transductions. The VNSAs have been compared to standard bigram and trigram language models and their reduced set of parameters does not affect by any means the performances in terms of perplexity. The design of a stochastic language model through the VNSA is described and applied to word and phrase class-based language models. The effectiveness of VNSAs has been tested within the Air Travel Information System (ATIS) task to build the language model for the speech recognition and the language understanding system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stochastic automata for language modeling

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Journal: Computer Speech & Language	Publication Date: Oct 1, 1996
Citations: 99

Similar Papers

Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition
X Chen ... X Liu
-
X Chen, et. al.X Chen ... X Liu
20 Aug 2017
20 Aug 2017

Dynamic out-of-vocabulary word registration to language model for speech recognition
Norihide Kitaoka ... Bohan Chen
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2021
Norihide Kitaoka, et. al.Norihide Kitaoka ... Bohan Chen
25 Jan 2021
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2021

Bidirectional recurrent neural network language models for automatic speech recognition
Ebru Arisoy ... Stanley Chen
-
Ebru Arisoy, et. al.Ebru Arisoy ... Stanley Chen
01 Apr 2015
01 Apr 2015

Minimum word error training of long short-term memory recurrent neural network language models for speech recognition
Takaaki Hori ... Chiori Hori
-
Takaaki Hori, et. al.Takaaki Hori ... Chiori Hori
01 Mar 2016
01 Mar 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stochastic automata for language modeling

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language