Abstract
AbstractThis paper presents empirical evaluation of three most important topic modeling techniques—latent semantic analysis, latent Dirichlet allocation and correlated topic modeling, The novelty of work exists in application or domain-independent comparative evaluation of techniques, that is not presented before in the literature. Most of the work on topic modeling is very application specific, so it becomes difficult to conclude a generalization solution. In this research, a detailed comparative evaluation of topic modeling techniques is presented, and it is found that latent Dirichlet allocation technique is the true synonym for topic modeling. So latent Dirichlet allocation is a truly unsupervised kind of machine learning technique, correlated topic modeling is another Dirichlet free probabilistic topic modelling to find correlation among topics. Latent semantic analysis presents semantic gist of corpus at the diagonal in such an effective way, that it can be considered as a very promising technique in natural language processing and text mining.KeywordsLatent semantic analysisLatent Dirichlet allocationBag of Words (BoW)Dimension reduction
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.