Improving Information Retrieval System by Co-clustering Web Documents and Queries

Liu Yufeng ,Li Renfa

doi:10.4156/aiss.vol3.issue8.32

Abstract

World Wide Web is considered the most valuable place for Information Retrieval and Knowledge Discovery. While retrieving information through user queries, a search engine results in a large and unmanageable collection of documents. A more efficient way to organize the documents can be a combination of clustering and ranking, where clustering can group the documents and ranking can be applied for ordering the pages within each cluster. This paper proposes an approach to co-clustering web documents and queries. When user issues a query, we construct a Query-Document Bipartite Graph from click log data. Then, we co-cluster the web documents and queries simultaneous based on the bipartite spectral graph partitioning which uses the second singular vectors of an appropriately scaled query-document matrix to yield good bipartition and rank the queries and documents on the bipartite graph via an iterative process like HITS. The results of experiments show promising improvement.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving Information Retrieval System by Co-clustering Web Documents and Queries

Abstract

Talk to us

Similar Papers

More From: INTERNATIONAL JOURNAL ON Advances in Information Sciences and Service Sciences

Lead the way for us

Journal: INTERNATIONAL JOURNAL ON Advances in Information Sciences and Service Sciences	Publication Date: Sep 30, 2011
Citations: 12

Similar Papers

Analyzing phonetic variation in the traditional English dialects: Simultaneously clustering dialects and phonetic features
M. Wieling ... J. Nerbonne
Literary and Linguistic Computing | VOL. 28
M. Wieling, et. al.M. Wieling ... J. Nerbonne
04 Jan 2013
Literary and Linguistic Computing | VOL. 28

Web Image Clustering with Reduced Keywords and Weighted Bipartite Spectral Graph Partitioning
Su Ming Koh ... Liang-Tien Chia
-
Su Ming Koh, et. al.Su Ming Koh ... Liang-Tien Chia
01 Jan 2006
01 Jan 2006

Co-clustering documents and words using bipartite spectral graph partitioning
Inderjit S Dhillon
-
Inderjit S DhillonInderjit S Dhillon
26 Aug 2001
26 Aug 2001

Bipartite spectral graph partitioning for clustering dialect varieties and detecting their linguistic features
Martijn Wieling ... John Nerbonne
Computer Speech & Language | VOL. 25
Martijn Wieling, et. al.Martijn Wieling ... John Nerbonne
21 May 2010
Computer Speech & Language | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Information Retrieval System by Co-clustering Web Documents and Queries

Abstract

Talk to us

Similar Papers

More From: INTERNATIONAL JOURNAL ON Advances in Information Sciences and Service Sciences