Cocluster hypothesis and ranking consistency for relevance ranking in web search

Jian‐De Jiang,Jyun‐Yu Jiang,Pu‐Jen Cheng

doi:10.1002/asi.24071

Abstract

Conventional approaches to relevance ranking typically optimize ranking models by each query separately. The traditional cluster hypothesis also does not consider the dependency between related queries. The goal of this paper is to leverage similar search intents to perform ranking consistency so that the search performance can be improved accordingly. Different from the previous supervised approach, which learns relevance by click‐through data, we propose a novel cocluster hypothesis to bridge the gap between relevance ranking and ranking consistency. A nearest‐neighbors test is also designed to measure the extent to which the cocluster hypothesis holds. Based on the hypothesis, we further propose a two‐stage unsupervised approach, in which two ranking heuristics and a cost function are developed to optimize the combination of consistency and uniqueness (or inconsistency). Extensive experiments have been conducted on a real and large‐scale search engine log. The experimental results not only verify the applicability of the proposed cocluster hypothesis but also show that our approach is effective in boosting the retrieval performance of the commercial search engine and reaches a comparable performance to the supervised approach.

Full Text