A novel topic model for documents by incorporating semantic relations between words

Jihong Chen,Zhuo Tang,Yuan Zhou,Yufei Liu,Zheng Chen,Kai Zhang,Li Yin

doi:10.1007/s00500-019-04604-0

Abstract

Topic models have been widely used to infer latent topics in text documents. However, the unsupervised topic models often result in incoherent topics, which always confused users in applications. Incorporating prior domain knowledge into topic models is an effective strategy to extract coherent and meaningful topics. In this paper, we go one step further to explore how different forms of prior semantic relations of words can be encoded into models to improve the performance of topic modeling process. We develop a novel topic model—called Mixed Word Correlation Knowledge-based Latent Dirichlet Allocation—to infer latent topics from text corpus. Specifically, the proposed model mines two forms of lexical semantic knowledge based on recent progress in word embedding, which can represent semantic information of words in a continuous vector space. To incorporate generated prior knowledge, a Mixed Markov Random Field is constructed over the latent topic layer to regularize the topic assignment of each word during the topic sampling process. Experimental results on two public benchmark datasets illustrate the superior performance of the proposed approach over several state-of-the-art baseline models.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A novel topic model for documents by incorporating semantic relations between words

Abstract

Talk to us

Similar Papers

More From: Soft Computing - A Fusion of Foundations, Methodologies and Applications

Lead the way for us

Journal: Soft Computing - A Fusion of Foundations, Methodologies and Applications	Publication Date: Dec 23, 2019
Citations: 5

Similar Papers

Incorporating Biterm Correlation Knowledge into Topic Modeling for Short Texts
Kai Zhang ... Yuan Zhou
The computer journal | VOL. 65
Kai Zhang, et. al.Kai Zhang ... Yuan Zhou
08 Jul 2020
The computer journal | VOL. 65

Collaboratively Modeling and Embedding of Latent Topics for Short Texts
Zheng Liu ... Tingting Qin
IEEE access : practical innovations, open solutions | VOL. 8
Zheng Liu, et. al.Zheng Liu ... Tingting Qin
01 Jan 2020
IEEE access : practical innovations, open solutions | VOL. 8

An Overview of Topic Representation and Topic Modelling Methods for Short Texts and Long Corpus
D Yamunathangam ... L Latha
-
D Yamunathangam, et. al.D Yamunathangam ... L Latha
08 Oct 2021
08 Oct 2021

A Semantic Based Approach for Topic Evaluation in Information Filtering
Yue Xu ... Hanh Nguyen
IEEE access : practical innovations, open solutions | VOL. 8
Yue Xu, et. al.Yue Xu ... Hanh Nguyen
01 Jan 2020
IEEE access : practical innovations, open solutions | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A novel topic model for documents by incorporating semantic relations between words

Abstract

Talk to us

Similar Papers

More From: Soft Computing - A Fusion of Foundations, Methodologies and Applications