A graph convolutional topic model for short and noisy text streams

Ngo Van Linh,Tran Xuan Bach,Khoat Than

doi:10.1016/j.neucom.2021.10.047

Abstract

Learning hidden topics from data streams has become absolutely necessary but posed challenging problems such as concept drift as well as short and noisy data. Using prior knowledge to enrich a topic model is one of potential solutions to cope with these challenges. Prior knowledge that is derived from human knowledge (e.g. Wordnet) or a pre-trained model (e.g. Word2vec) is very valuable and useful to help topic models work better. However, in a streaming environment where data arrives continually and infinitely, existing studies are limited to exploiting these resources effectively. Especially, a knowledge graph, that contains meaningful word relations, is ignored. In this paper, to aim at exploiting a knowledge graph effectively, we propose a novel graph convolutional topic model (GCTM) which integrates graph convolutional networks (GCN) into a topic model and a learning method which learns the networks and the topic model simultaneously for data streams. In each minibatch, our method not only can exploit an external knowledge graph but also can balance the external and old knowledge to perform well on new data. We conduct extensive experiments to evaluate our method with both a human knowledge graph (Wordnet) and a graph built from pre-trained word embeddings (Word2vec). The experimental results show that our method achieves significantly better performances than state-of-the-art baselines in terms of probabilistic predictive measure and topic coherence. In particular, our method can work well when dealing with short texts as well as concept drift.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A graph convolutional topic model for short and noisy text streams

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Oct 21, 2021
Citations: 13

Similar Papers

Investigating the Efficient Use of Word Embedding with Neural-Topic Models for Interpretable Topics from Short Texts.
Riki Murakami ... Basabi Chakraborty
Sensors | VOL. 22
Riki Murakami, et. al.Riki Murakami ... Basabi Chakraborty
23 Jan 2022
Sensors | VOL. 22

Use of Neural Topic Models in conjunction with Word Embeddings to extract meaningful topics from short texts
Nassera Habbat ... Houda Anoun
EAI Endorsed Transactions on Internet of Things | VOL. 8
Nassera Habbat, et. al.Nassera Habbat ... Houda Anoun
30 Sep 2022
EAI Endorsed Transactions on Internet of Things | VOL. 8

A Drift-Sensitive Distributed LSTM Method for Short Text Stream Classification
Peipei Li ... Yang Hu
IEEE Transactions on Big Data | VOL. 9
Peipei Li, et. al.Peipei Li ... Yang Hu
01 Feb 2023
IEEE Transactions on Big Data | VOL. 9

Online Biterm Topic Model based short text stream classification using short text expansion and concept drifting detection
Xuegang Hu ... Peipei Li
Pattern Recognition Letters | VOL. 116
Xuegang Hu, et. al.Xuegang Hu ... Peipei Li
16 Oct 2018
Pattern Recognition Letters | VOL. 116

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A graph convolutional topic model for short and noisy text streams

Abstract

Talk to us

Similar Papers

More From: Neurocomputing