Abstract

With the development of scientific research, scientific publications are valuable resources for new-comers in the research field. But massive scientific publications make it a challenge for researchers diving into a new research field. As a good practice to this problem, topics are put forward to organize publications. In this paper, we propose two modified LDA topic models as solutions to topic analysis and influential paper discovery on scientific publications, cc-LDA and cp-LDA. Compared to state-of-the-art researches on LDA, we incorporate citation information including its occurrence times and occurrence position into our models. Model cc-LDA integrates paper content and citation occurrence into LDA model, while cp-LDA considers both occurrence and position of citations. Both models can not only find topics in the form of citation distribution, but also help discover influential papers under certain topics. Furthermore, both models can extract more representative vectors for papers, which achieve good performance in subsequent clustering.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.