Performance evaluation of Latent Dirichlet Allocation in text mining

Zelong Liu,Mahesh Ponraj,Yang Liu,Maozhen Li

doi:10.1109/fskd.2011.6020066

Performance evaluation of Latent Dirichlet Allocation in text mining

Zelong Liu, Mahesh Ponraj + Show 2 more

https://doi.org/10.1109/fskd.2011.6020066

Copy DOI

Publication Date: Jul 1, 2011

Citations: 26

Affiliation: Brunel University London

#Latent Dirichlet Allocation Model #Support Vector Machine + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper introduces three classic models of statistical topic models: Latent Semantic Indexing (LSI), Probabilistic Latent Semantic Indexing (PLSI) and Latent Dirichlet Allocation (LDA). Then a method of text classification based on LDA model is briefly described, which uses LDA model as a text representation method. Each document means a probability distribution of fixed latent topic sets. Next, Support Vector Machine (SVM) is chose as classification algorithm. Finally, the evaluation parameters in classification system of LDA with SVM are higher than other two methods which are LSI with SVM and VSM with SVM, showing a better classification performance.

Full Text