Abstract

The Twitter micro blogging site is one of the most popular online social media in today's Web. During an important event, such as a disaster event, hundreds of thousands of tweets are posted rapidly on Twitter. The information is posted too fast for anyone to make sense, hence the information needs to be organized in order to utilize the information effectively. It has been observed that many of the tweets posted during an event are very similar to each other, hence clustering or grouping similar tweets is an effective way to reduce the information load. However, clustering of tweets is challenging because of the small size and noisy nature of tweets. In this work, we propose a novel clustering approach for tweets, which combines two different approaches — a traditional clustering approach K-Means, and an evolutionary approach, Genetic Algorithms. We conduct experiments on a dataset of real tweets collected during a recent disaster event, and show that the proposed methodology performs better than several existing clustering techniques.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.