Abstract

A body of scientific literature can be represented by a directed graph, which is commonly referred to as the citation graph. This paper describes an implementation of the random walk graph clustering algorithm to identify the communities within the citation graph. Based only on the link structure of the citation graph, the random walk graph clustering algorithm is able to efficiently identity highly topically related communities. The ability to identify community structure in the citation graph clearly has practical application. Communities in the citation graph represent related papers on a single topic. This approach is used as a method for finding related papers to a paper of interest by treating member papers in the same community of papers of interest as related papers. The performance of the random walk graph clustering algorithm was evaluated compared to the method for finding related papers used by ResearchIndex, CCIDF. The experimental results show that the random walk graph clustering algorithm performs better than the CCIDF.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call