CPB: a classification-based approach for burst time prediction in cascades

Senzhang Wang,Biao Wang,Zhao Yan,Zhoujun Li,Philip S Yu,Xia Hu

doi:10.1007/s10115-015-0899-3

Abstract

Studying the bursty nature of cascades in social media is practically important in many real applications such as product sales prediction, disaster relief, and stock market prediction. Although both the cascade size prediction and the burst patterns of the cascades have been extensively studied, how to predict when a burst will come remains an open problem. It is challenging for traditional time-series-based models such as regression models to address this task directly. Firstly, times-series-based prediction models focus on predicting the future values based on previously observed ones. It is hard to apply them to predict the time of a bursts with the "quick rise-and-fall" pattern. Secondly, besides the cascade popularity, a lot of other side information like user profile and social relation are available in social media. Although the potential utility of such information can be high, it is also hard for time-series-based models to capture and integrate these rich information with diverse formats seamlessly. This paper proposes a classification-based approach for burst time prediction by exploiting rich knowledge in information diffusion. Particularly, we first propose a time-window-based transformation to predict in which time window the burst will appear. By dividing the time spans of all the cascades into the same number of time windows K, the cascades with diverse time spans can thus be handled uniformly. To exploit the rich and heterogenous information in social media, we next propose a scale-independent feature extraction framework to model the heterogenous knowledge in a scale-independent manner. Systematical evaluations are conducted on the Sina Weibo reposting dataset and MemeTracker dataset. Besides the superior performance of the proposed approach, we also observe that: (1) surprisingly, social/structure knowledge is more indicative of the bursts than the cascade popularity information, especially for the bursts occurring in a farther future. (2) Larger cascades are harder to predict as the spreading process of the cascades with higher popularity is usually more diverse and fluctuant. (3) The proposed approach is robust in the sense that the result is not much sensitive to the popularity of the training cascades.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CPB: a classification-based approach for burst time prediction in cascades

Abstract

Talk to us

Similar Papers

More From: Knowledge and Information Systems

Lead the way for us

Journal: Knowledge and Information Systems	Publication Date: Dec 9, 2015
Citations: 42

Similar Papers

Burst Time Prediction in Cascades
Senzhang Wang ... Zhao Yan
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 29
Senzhang Wang, et. al.Senzhang Wang ... Zhao Yan
09 Feb 2015
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 29

Privacy Data Diffusion Modeling and Preserving in Online Social Network
Xiangyu Hu ... Wanlei Zhou
IEEE Transactions on Knowledge and Data Engineering | VOL. -
Xiangyu Hu, et. al.Xiangyu Hu ... Wanlei Zhou
01 Jan 2021
IEEE Transactions on Knowledge and Data Engineering | VOL. -

Navigating Social Media in #Ophthalmology
Edmund Tsui ... Rajesh C Rao
Ophthalmology | VOL. 126
Edmund Tsui, et. al.Edmund Tsui ... Rajesh C Rao
20 May 2019
Ophthalmology | VOL. 126

Parents' Use of Social Media as a Health Information Source for Their Children: A Scoping Review.
Erika Frey ... Jane Frawley
Academic Pediatrics | VOL. 22
Erika Frey, et. al.Erika Frey ... Jane Frawley
01 May 2022
Academic Pediatrics | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CPB: a classification-based approach for burst time prediction in cascades

Abstract

Talk to us

Similar Papers

More From: Knowledge and Information Systems