Parsimonious translation models for information retrieval

Seung-Hoon Na,In-Su Kang,Jong-Hyeok Lee

doi:10.1016/j.ipm.2006.04.005

Abstract

In the KL divergence framework, the extended language modeling approach has a critical problem of estimating a query model, which is the probabilistic model that encodes the user’s information need. For query expansion in initial retrieval, the translation model had been proposed to involve term co-occurrence statistics. However, the translation model was difficult to apply, because the term co-occurrence statistics must be constructed in the offline time. Especially in a large collection, constructing such a large matrix of term co-occurrences statistics prohibitively increases time and space complexity. In addition, reliable retrieval performance cannot be guaranteed because the translation model may comprise noisy non-topical terms in documents. To resolve these problems, this paper investigates an effective method to construct co-occurrence statistics and eliminate noisy terms by employing a parsimonious translation model. The parsimonious translation model is a compact version of a translation model that can reduce the number of terms containing non-zero probabilities by eliminating non-topical terms in documents. Through experimentation on seven different test collections, we show that the query model estimated from the parsimonious translation model significantly outperforms not only the baseline language modeling, but also the non-parsimonious models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parsimonious translation models for information retrieval

Abstract

Talk to us

Similar Papers

More From: Information Processing and Management

Lead the way for us

Journal: Information Processing and Management	Publication Date: Jun 12, 2006
Citations: 25

Similar Papers

Effective Query Model Estimation Using Parsimonious Translation Model in Language Modeling Approach
Seung-Hoon Na ... Ji-Eun Roh
-
Seung-Hoon Na, et. al.Seung-Hoon Na ... Ji-Eun Roh
01 Jan 2004
01 Jan 2004

Estimation of Query Model from Parsimonious Translation Model
Seung-Hoon Na ... Jong-Hyeok Lee
-
Seung-Hoon Na, et. al.Seung-Hoon Na ... Jong-Hyeok Lee
01 Jan 2004
01 Jan 2004

Language modeling for voice search: A machine translation approach
Xiao Li ... Geoffrey Zweig
-
Xiao Li, et. al. Xiao Li ... Geoffrey Zweig
01 Mar 2008
01 Mar 2008

Dynamic Fusion: Attentional Language Model for Neural Machine Translation
Michiki Kurosawa ... Mamoru Komachi
-
Michiki Kurosawa, et. al.Michiki Kurosawa ... Mamoru Komachi
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parsimonious translation models for information retrieval

Abstract

Talk to us

Similar Papers

More From: Information Processing and Management