Dynamic topic modeling via self-aggregation for short text streams

Lei Shi,Meiyu Liang,Junping Du,Feifei Kou

doi:10.1007/s12083-018-0692-7

Abstract

Social networks such as Twitter, Facebook, and Sina microblogs have emerged as major sources for discovering and sharing the latest topics. Because social network topics change quickly, developing an effective method to model such topics is urgently needed. However, topic modeling is challenging due to the sparsity problem and the dynamic change of topics in microblog streams. In this study, we propose dynamic topic modeling via a self-aggregation method (SADTM) that can capture the time-varying aspect of topic distributions and resolve the sparsity problem. The SADTM aggregates the observable and unordered short texts as the aggregated document without leveraging an external context to overcome the sparsity problem of short text. Furthermore, we exploit word pairs instead of words for each microblog to generate more word co-occurrence patterns. The SADTM models temporal dynamics by using the topic distribution at previous time steps in microblog streams to infer the current topic from sequential data. Extensive experiments on a real-world Sina microblog dataset demonstrate that our SADTM algorithm outperforms several state-of-the-art methods in topic coherence and cluster quality. Additionally, when applied in a search scene, our SADTM significantly outperforms all baseline methods in terms of the quality of the search results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dynamic topic modeling via self-aggregation for short text streams

Abstract

Talk to us

Similar Papers

More From: Peer-to-Peer Networking and Applications

Lead the way for us

Journal: Peer-to-Peer Networking and Applications	Publication Date: Nov 14, 2018
Citations: 17

Similar Papers

GPU-BTM: A Topic Model for Short Text using Auxiliary Information
Yibing Guo ... Yutao Huang
-
Yibing Guo, et. al.Yibing Guo ... Yutao Huang
01 Jul 2020
01 Jul 2020

Fuzzy topic modeling approach for text mining over short text
Junaid Rashid ... Aun Irtaza
Information Processing & Management | VOL. 56
Junaid Rashid, et. al.Junaid Rashid ... Aun Irtaza
21 Jun 2019
Information Processing & Management | VOL. 56

A biterm topic model for short texts
Xiaohui Yan ... Jiafeng Guo
-
Xiaohui Yan, et. al.Xiaohui Yan ... Jiafeng Guo
13 May 2013
13 May 2013

TSSE-DMM: Topic Modeling for Short Texts Based on Topic Subdivision and Semantic Enhancement
Chengcheng Mai ... Bo Zhao
-
Chengcheng Mai, et. al.Chengcheng Mai ... Bo Zhao
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamic topic modeling via self-aggregation for short text streams

Abstract

Talk to us

Similar Papers

More From: Peer-to-Peer Networking and Applications