QWI: a method for improved smoothing in language modelling

G Bordel,I Torres,E Vidal

doi:10.1109/icassp.1995.479395

Abstract

N-grams have been extensively and successfully used for language modelling in continuous speech recognition tasks. On the other hand, it has been shown that k-testable stochastic languages (k-TS) are strictly equivalent to N-grams. A major problem to be solved when using a language model is the estimation of the probabilities of events not represented in the training corpus, i.e. unseen events. The aim of this work is to improve other well established smoothing procedures by interpolating models with different levels of complexity (quality weighted interpolation-QWI). The effect of QWI was experimentally evaluated over a set of back-off smoothed k-TS language models. These experiments were carried out over several corpora using the test-set perplexity as an evaluation criterion. In all the cases the introduction of QWI resulted in a reduction of the test-set perplexity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

QWI: a method for improved smoothing in language modelling

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Keyword-spotting using SRI's DECIPHER large-vocabulary speech-recognition system
M Weintraub
-
M WeintraubM Weintraub
01 Jan 1992
01 Jan 1992

Leveraging relevance cues for language modeling in speech recognition
Berlin Chen ... Kuan-Yu Chen
Information Processing and Management | VOL. 49
Berlin Chen, et. al.Berlin Chen ... Kuan-Yu Chen
28 Feb 2013
Information Processing and Management | VOL. 49

Acquisition of language models based on HMnet
Motoyuki Suzuki ... Hirotomo Aso
The Journal of the Acoustical Society of America | VOL. 100
Motoyuki Suzuki, et. al.Motoyuki Suzuki ... Hirotomo Aso
01 Oct 1996
The Journal of the Acoustical Society of America | VOL. 100

Mixture Probabilistic Context-Free Grammar
Kenji Kita
Journal of Natural Language Processing | VOL. 3
Kenji KitaKenji Kita
01 Jan 1996
Journal of Natural Language Processing | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

QWI: a method for improved smoothing in language modelling

Abstract

Talk to us

Similar Papers