Combining Knowledge Graph and Word Embeddings for Spherical Topic Modeling.

Hafsa Ennajari,Nizar Bouguila,Jamal Bentahar

doi:10.1109/tnnls.2021.3112045

Hafsa Ennajari, Nizar Bouguila + Show 1 more

Open Access

https://doi.org/10.1109/tnnls.2021.3112045

Copy DOI

Journal: IEEE transactions on neural networks and learning systems	Publication Date: Jul 1, 2023
Citations: 3	License type: publisher-specific, author manuscript

Affiliation: Concordia University

Abstract

Probabilistic topic models are considered as an effective framework for text analysis that uncovers the main topics in an unlabeled set of documents. However, the inferred topics by traditional topic models are often unclear and not easy to interpret because they do not account for semantic structures in language. Recently, a number of topic modeling approaches tend to leverage domain knowledge to enhance the quality of the learned topics, but they still assume a multinomial or Gaussian document likelihood in the Euclidean space, which often results in information loss and poor performance. In this article, we propose a Bayesian embedded spherical topic model (ESTM) that combines both knowledge graph and word embeddings in a non-Euclidean curved space, the hypersphere, for better topic interpretability and discriminative text representations. Extensive experimental results show that our proposed model successfully uncovers interpretable topics and learns high-quality text representations useful for common natural language processing (NLP) tasks across multiple benchmark datasets.

Full Text