Abstract

Citation recommendation systems mainly help researchers find the lists of references that related to their interests effectively and automatically. The existing approaches face the issues of data sparsity and high-dimensional in large-scale bibliographic network representation, which hinder the citation recommendation performance. To address these problems, we proposed a Content-Sensitive citation representation approach for Citation Recommendation, named CSCR. Firstly, the Doc2vec model is used to generate a paper embedding according to paper content. Then, utilizing the similarity between the paper content embeddings to select the assumed neighbours of the target paper, append the auxiliary links between target paper and its new neighbours in the bibliographic network. Thirdly, distributed network representation method is implemented on appended bibliographic network to obtain the paper node embedding, which can learn interpretable lower dimension embedding for paper nodes. Finally, the embedding vectors of these papers can be used to conduct citation recommendation. Experimental results show that the proposed approach significantly outperforms other benchmark methods in Normalized Discounted Cumulative Gain (NDCG) and the positive rate (Recall).

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call