A density-based method for adaptive LDA model selection

Juan Cao,Tian Xia,Jintao Li,Yongdong Zhang,Sheng Tang

doi:10.1016/j.neucom.2008.06.011

A density-based method for adaptive LDA model selection

Juan Cao, Tian Xia + Show 3 more

https://doi.org/10.1016/j.neucom.2008.06.011

Copy DOI

Journal: Neurocomputing	Publication Date: Aug 28, 2008
Citations: 534

Affiliation: Institute of Computing Technology, Chinese Academy of Sciences, University of Chinese Academy of Sciences

#Topics In Latent Dirichlet Allocation #Appropriate Number Of Topics + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Topic models have been successfully used in information classification and retrieval. These models can capture word correlations in a collection of textual documents with a low-dimensional set of multinomial distribution, called “topics”. However, it is important but difficult to select the appropriate number of topics for a specific dataset. In this paper, we study the inherent connection between the best topic structure and the distances among topics in Latent Dirichlet allocation (LDA), and propose a method of adaptively selecting the best LDA model based on density. Experiments show that the proposed method can achieve performance matching the best of LDA without manually tuning the number of topics.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Neurocomputing

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.