Abstract

In social media, micro-blogging is a common routine of most people around the world. Social media analytics gathering structure and unstructured Big-Data from various social sites and analyzing to make business decisions using Apache Hadoop and Apache Hive respectively. The objective of the research work was to develop big data technology used for gathering and handling large unstructured data from real time social media for sentiment analysis for identifying the brand and services. The methodology devised an algorithm based on sentimental analysis using customers review classification, which dealt with prepossessing the datasets, clustering of the data based on the specific domains, feature vector using n-gram models and tf-idf vectors extracts synonyms and classification sentiment analysis. The result shows that applied sentimental analysis with unsupervised clustering of data into specific domains and supervised machine learning techniques handle large amounts of twitter data in an efficient way. The developed tool 1.5 times faster than that of traditional database to Hadoop cluster and also the accuracy is nearly 80 %, which helps the user in computing, analyzing and interpreting interaction and associations between people, topics and ideas.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call