Abstract

The growth in data generated by social media platforms like Twitter provides a wealth of potential information waiting to be extracted (or mined) - traditionally with a price tag. With the recent advancements in Open Source technologies, specifically Big Data, within the Information Technology world, businesses have started to gather as much information as possible about their customers and market space. The Big Data platform, Hadoop, has become extremely proficient at managing social media data ingestion, storage and processing, due to its ability to use both structured and unstructured data. The aim of this study is to demonstrate a Big Data environment running on Open Source technologies, in order to explore the possibilities of performing geo-located sentiment analytics on Twitter data. Subsequent to this, the link between events and changes in population sentiment was investigated. In this study, an average of 47% of the total tweets ingested were geo-locatable to a country. The Open Source Big Data software was able to demonstrate the reliability of the environment, as well as identify possible limitations to having an environment setup like the one used in this study. A number of research sub-questions were answered, one of which provided information suggesting causality between an event and the change in a populations sentiment when focusing on the events specific topic on Twitter. By performing sentiment analytics on the Twitter data, potential influential users were identifiable for each use case, while allowing additional analytics to be performed and so highlight themes and trends within the data. Three use cases will be concisely addressed in this paper. The first is the Oscar Pistorius Trial (legal), the second is the FIFA World Cup of 2014 (sport), and the last one, a movie titled Maze Runner.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.