Abstract

Since social media started getting more attention from users on the Internet, social media has been one of the most important information source in the world. Especially, with the increasing popularity of social media, data posted on social media sites are rapidly becoming collective intelligence, which is a term used to refer to new media that is displacing traditional media. In this paper, we focus on geotagged tweets on the Twitter site. These geotagged tweets are referred to as georeferenced documents because they include not only a short text message, but also the documents’ posting time and location. Many researchers have been tackling the development of new data mining techniques for georeferenced documents to identify and analyze emergency topics, such as natural disasters, weather, diseases, and other incidents. In particular, the utilization of geotagged tweets to identify and analyze natural disasters has received much attention from administrative agencies recently because some case studies have achieved compelling results. In this paper, we propose a novel real-time analysis application for identifying bursty local areas related to emergency topics. The aim of our new application is to provide new platforms that can identify and analyze the localities of emergency topics. The proposed application is composed of three core computational intelligence techniques: the Naive Bayes classifier technique, the spatiotemporal clustering technique, and the burst detection technique. Moreover, we have implemented two types of application interface: a Web application interface and an android application interface. To evaluate the proposed application, we have implemented a real-time weather observation system embedded the proposed application. we used actual crawling geotagged tweets posted on the Twitter site. The weather observation system successfully detected bursty local areas related to observed emergency weather topics.Electronic supplementary materialThe online version of this article (doi:10.1186/s40064-015-0817-x) contains supplementary material, which is available to authorized users.Electronic supplementary materialThe online version of this article (doi:10.1186/s40064-015-0817-x) contains supplementary material, which is available to authorized users.

Highlights

  • In recent years, social media has played a significant role as an alternative source of information (Kavanaugh et al 2011; Yin et al 2012)

  • Tamura and Ichimura (2013) proposed a novel density-based spatiotemporal clustering algorithm that can extract spatially and temporally separated spatial clusters in georeferenced documents. They proposed the (, τ )-density-based spatiotemporal clustering algorithm; the experimental results indicate that their proposed algorithm can extract bursty local areas in a set of geotagged tweets, including keywords related to a topic related to weather topics

  • We propose a novel real-time analysis application for identifying bursty local areas related to emergency topics

Read more

Summary

Introduction

Social media has played a significant role as an alternative source of information (Kavanaugh et al 2011; Yin et al 2012). Tamura and Ichimura (2013) proposed a novel density-based spatiotemporal clustering algorithm that can extract spatially and temporally separated spatial clusters in georeferenced documents. They proposed the ( , τ )-density-based spatiotemporal clustering algorithm; the experimental results indicate that their proposed algorithm can extract bursty local areas in a set of geotagged tweets, including keywords related to a topic related to weather topics.

Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.