Re-ranking search results using language models of query-specific clusters

Oren Kurland

doi:10.1007/s10791-008-9065-9

Abstract

To obtain high precision at top ranks by a search performed in response to a query, researchers have proposed a cluster-based re-ranking paradigm: clustering an initial list of documents that are the most highly ranked by some initial search, and using information induced from these (often called) query-specific clusters for re-ranking the list. However, results concerning the effectiveness of various automatic cluster-based re-ranking methods have been inconclusive. We show that using query-specific clusters for automatic re-ranking of top-retrieved documents is effective with several methods in which clusters play different roles, among which is the smoothing of document language models. We do so by adapting previously-proposed cluster-based retrieval approaches, which are based on (static) query-independent clusters for ranking all documents in a corpus, to the re-ranking setting wherein clusters are query-specific. The best performing method that we develop outperforms both the initial document-based ranking and some previously proposed cluster-based re-ranking approaches; furthermore, this algorithm consistently outperforms a state-of-the-art pseudo-feedback-based approach. In further exploration we study the performance of cluster-based smoothing methods for re-ranking with various (soft and hard) clustering algorithms, and demonstrate the importance of clusters in providing context from the initial list through a comparison to using single documents to this end.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Re-ranking search results using language models of query-specific clusters

Abstract

Talk to us

Similar Papers

More From: Information Retrieval

Lead the way for us

Journal: Information Retrieval	Publication Date: Jul 10, 2008
Citations: 55

Similar Papers

Soft and hard clustering of geoelectrical data for detection of leachate accumulation zones in municipal solid waste landfills
Davide Melegari ... Giorgio De Donno
-
Davide Melegari, et. al.Davide Melegari ... Giorgio De Donno
08 Mar 2024
08 Mar 2024

Different Schemes for Improving Fuzzy Clustering Through Supervised Learning
Anup Kumar Mallick ... Anirban Mukhopadhyay
-
Anup Kumar Mallick, et. al.Anup Kumar Mallick ... Anirban Mukhopadhyay
01 Jan 2019
01 Jan 2019

EEW-SC: Enhanced Entropy-Weighting Subspace Clustering for high dimensional gene expression data clustering analysis
Zhaohong Deng ... Shitong Wang
Applied Soft Computing Journal | VOL. 11
Zhaohong Deng, et. al.Zhaohong Deng ... Shitong Wang
24 Jul 2011
Applied Soft Computing Journal | VOL. 11

Soft Short-Text Clustering using PageRank as a Centrality Measure
Khaled Abdalgader
-
Khaled AbdalgaderKhaled Abdalgader
24 Feb 2017
24 Feb 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Re-ranking search results using language models of query-specific clusters

Abstract

Talk to us

Similar Papers

More From: Information Retrieval