Abstract

Micro blogging is fast becoming a dominant medium in social media and its impact is evident in our daily lives. A massive amount of information is produced on a daily basis. It is observed that detecting hot topics can be very helpful for people to get essential information quickly. But due to short and sparse features, high flood of meaningless tweets and other characteristics of micro blogs, traditional topic detection methods are unable to achieve a desirable level of performance. In this paper, we propose a multi-attribute latent dirichlet allocation (MA-LDA) model, a topic analysis model in which the time and tag attributes of micro blogs are incorporated into LDA model. By introducing a time variable about the time attribute, MA-LDA model can decide whether a word should appear in hot topics or not. Applying tag attribute allows MA-LDA model to rank the core words high in results so that the expressiveness of outcomes can be improved over the traditional LDA model. Empirical evaluation on real data sets demonstrate our method is able to detect hot topics accurately and efficiently with more terms associated with each hot topic found. Our study provides strong evidence of the importance of the temporal factor in hot topics extraction.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.