Parallel Information Retrieval with Query Expansion

Yoojin Chung

doi:10.1007/3-540-48051-x_20

Abstract

An information retrieval (IR) system with query expansion on a low-cost high-performance PC cluster environment is implemented. The IR system stores document sets, it is indexed by the inverted-index-file (IIF), and the vector space model is used as ranking strategy. The query expansion is adding terms into the original query for raising retrieval effectiveness. In this work, the query expansion with the collocation-based similarity measure is used. In our parallel IR system, the inverted-index file (IIF) is partitioned into pieces using the lexical and the greedy declustering methods. For each incoming user's query withm ultiple terms after query expansion, terms are sent to the corresponding nodes that contain the relevant pieces of the IIF to be evaluated in parallel. We study how query performance is affected by query expansion and two declustering methods using two standard Korean test collections. According to the experiments, the greedy method shows about 20% enhancement overall when compared with the lexical method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parallel Information Retrieval with Query Expansion

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Declustering Web Content Indices for Parallel Information Retrieval
Yoojin Chung ... Kwang Ryel Ryu
-
Yoojin Chung, et. al.Yoojin Chung ... Kwang Ryel Ryu
01 Jan 2001
01 Jan 2001

Co-occurrence based predictors for estimating query difficulty
Hazra Imran ... Aditi Sharan
-
Hazra Imran, et. al.Hazra Imran ... Aditi Sharan
01 Dec 2010
01 Dec 2010

Information Retrieval on an SCI-Based PC Cluster
Sang-Hwa Chung ... Hyuk-Chul Kwon
The Journal of Supercomputing | VOL. 19
Sang-Hwa Chung, et. al.Sang-Hwa Chung ... Hyuk-Chul Kwon
01 Jan 2001
The Journal of Supercomputing | VOL. 19

Parallel Information Retrieval on an SCI-Based PC-NOW
Sang-Hwa Chung ... Jin-Hyuk Kim
-
Sang-Hwa Chung, et. al.Sang-Hwa Chung ... Jin-Hyuk Kim
01 Jan 1999
01 Jan 1999

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parallel Information Retrieval with Query Expansion

Abstract

Talk to us

Similar Papers