Minimum discrimination information-based language model adaptation using tiny domain corpora for intelligent personal assistants

Gil-Jin Jang,Ji-Hwan Kim,Saejoon Kim

doi:10.1109/tce.2012.6415007

Abstract

This paper proposes a novel Language Model (LM) adaptation method based on Minimum Discrimination Information (MDI). In the proposed method, a background LM is viewed as a discrete distribution and an adapted LM is built to be as close as possible to the background LM, while satisfying unigram constraint. This is due to the fact that there is a limited amount of domain corpus available for the adaptation of a natural language-based intelligent personal assistant system. Two unigram constraint estimation methods are proposed: one based on word frequency in the domain corpus, and one based on word similarity estimated from WordNet. In terms of the adapted LM's perplexity using word frequency in tiny domain corpora (ranging from 30~120 seconds in length) the relative performance improvements are measured at 13.9%~16.6%. Further relative performance improvements (1.5%~2.4%) are observed when WordNet is used to generate word similarities. These successes express an efficient ways for re-scaling and normalizing the conditional distribution, which uses an interpolation-based LM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Minimum discrimination information-based language model adaptation using tiny domain corpora for intelligent personal assistants

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Consumer Electronics

Lead the way for us

Journal: IEEE Transactions on Consumer Electronics	Publication Date: Nov 1, 2012
Citations: 2

Similar Papers

Dynamic language modeling for broadcast news
Langzhou Chen ... Lori Lamel
-
Langzhou Chen, et. al.Langzhou Chen ... Lori Lamel
04 Oct 2004
04 Oct 2004

Progressive language model adaptation for disaster broadcasting with closed-captions
Takahiro Oku ... Shoei Sato
-
Takahiro Oku, et. al.Takahiro Oku ... Shoei Sato
01 Oct 2013
01 Oct 2013

Language model adaptation via minimum discrimination information
P.S Rao ... S Roukos
-
P.S Rao, et. al.P.S Rao ... S Roukos
09 May 1995
09 May 1995

Efficient MDI Adaptation for n-Gram Language Models
Ruizhe Huang ... Ke Li
-
Ruizhe Huang, et. al.Ruizhe Huang ... Ke Li
25 Oct 2020
25 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Minimum discrimination information-based language model adaptation using tiny domain corpora for intelligent personal assistants

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Consumer Electronics