Exploring Symmetrical and Asymmetrical Dirichlet Priors for Latent Dirichlet Allocation

Shaheen Syed,Marco Spruit

doi:10.1142/s1793351x18400184

Abstract

Latent Dirichlet Allocation (LDA) has gained much attention from researchers and is increasingly being applied to uncover underlying semantic structures from a variety of corpora. However, nearly all researchers use symmetrical Dirichlet priors, often unaware of the underlying practical implications that they bear. This research is the first to explore symmetrical and asymmetrical Dirichlet priors on topic coherence and human topic ranking when uncovering latent semantic structures from scientific research articles. More specifically, we examine the practical effects of several classes of Dirichlet priors on 2000 LDA models created from abstract and full-text research articles. Our results show that symmetrical or asymmetrical priors on the document–topic distribution or the topic–word distribution for full-text data have little effect on topic coherence scores and human topic ranking. In contrast, asymmetrical priors on the document–topic distribution for abstract data show a significant increase in topic coherence scores and improved human topic ranking compared to a symmetrical prior. Symmetrical or asymmetrical priors on the topic–word distribution show no real benefits for both abstract and full-text data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring Symmetrical and Asymmetrical Dirichlet Priors for Latent Dirichlet Allocation

Abstract

Talk to us

Similar Papers

More From: International Journal of Semantic Computing

Lead the way for us

Journal: International Journal of Semantic Computing	Publication Date: Sep 1, 2018
Citations: 12

Similar Papers

Selecting Priors for Latent Dirichlet Allocation
Shaheen Syed ... Marco Spruit
-
Shaheen Syed, et. al.Shaheen Syed ... Marco Spruit
01 Jan 2018
01 Jan 2018

Full-Text or Abstract? Examining Topic Coherence Scores Using Latent Dirichlet Allocation
Shaheen Syed ... Marco Spruit
-
Shaheen Syed, et. al.Shaheen Syed ... Marco Spruit
01 Oct 2017
01 Oct 2017

Web content topic modeling using LDA and HTML tags.
Hamza H.M Altarturi ... Muntadher Saadoon
PeerJ. Computer science | VOL. 9
Hamza H.M Altarturi, et. al.Hamza H.M Altarturi ... Muntadher Saadoon
11 Jul 2023
PeerJ. Computer science | VOL. 9

A data-driven analysis to determine the optimal number of topics 'K' for latent Dirichlet allocation model
Astha Goyal ... Indu Kashyap
Indonesian Journal of Electrical Engineering and Computer Science | VOL. 35
Astha Goyal, et. al.Astha Goyal ... Indu Kashyap
01 Jul 2024
Indonesian Journal of Electrical Engineering and Computer Science | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring Symmetrical and Asymmetrical Dirichlet Priors for Latent Dirichlet Allocation

Abstract

Talk to us

Similar Papers

More From: International Journal of Semantic Computing