Query Classification using Wikipedia’s Category Graph

Milad Alemzadeh,Fakhri Karray,Richard Khoury

doi:10.4304/jetwi.4.3.207-220

Query Classification using Wikipedia’s Category Graph

Milad Alemzadeh, Fakhri Karray + Show 1 more

https://doi.org/10.4304/jetwi.4.3.207-220

Copy DOI

Journal: Journal of Emerging Technologies in Web Intelligence	Publication Date: Aug 1, 2012
Citations: 26

Affiliation: University of Waterloo, Lakehead University

#Wikipedia's Category Graph #User-specified Keywords + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Wikipedia's category graph is a network of 300,000 interconnected category labels, and can be a powerful resource for many classification tasks. However, its size and the lack of order can make it difficult to navigate. In this paper, we present a new algorithm to efficiently exploit this graph and accurately rank classification labels given user-specified keywords. We highlight multiple possible variations of this algorithm, and study the impact of these variations on the classification results in order to determine the optimal way to exploit the category graph. We implement our algorithm as the core of a query classification system and demonstrate its reliability using the KDD CUP 2005 and TREC 2007 competitions as benchmarks.

Full Text