Abstract

Sub-event detection for social media is getting increasingly important because social media platforms have played key roles in disseminating vital information to the public. However, conventional methods fail to achieve high-quality analysis of events evolution due to the severe sparseness and noise of tweets data. In this paper, we propose an unsupervised sub-event detection model which learns rich information from hashtags with a Text-CNN model. Furthermore, a two steps training method leveraged by KL divergence is introduced to further reduce the negative influence of incoherent semantics. The experiments show that our method achieves very good performance on datasets of different languages. On the Chinese dataset, in terms of NMI, and BCubed F1 precision, our method has a significant increase of 5.1%, and 11.9%, respectively, over the baseline methods. In most cases, our method significantly improves the performance of sub-event detection compared with state-of-the-art methods.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call