Abstract

Sentiment analysis is an eminent part of data mining for the investigation of user perception. Twitter is one of the popular social platforms for expressing thoughts in the form of tweets. Nowadays, tweets are widely used for analyzing the sentiments of the users, and utilized for decision making purposes. Though clustering and classification methods are used for the twitter sentiment analysis, meta-heuristic based clustering methods has witnessed better performance due to subjective nature of tweets. However, sequential meta-heuristic based clustering methods are computation intensive for large scale datasets. Therefore, in this paper, a novel MapReduce based K-means biogeography based optimizer(MR-KBBO) is proposed to leverage the strength of biogeography based optimizer with MapReduce model to efficiently cluster the large scale data. The proposed method is validated against four state-of-the-art MapReduce based clustering methods namely; parallel K-means, parallel K-means particle swarm optimization, MapReduce based artificial bee colony optimization, dynamic frequency based parallel k-bat algorithm on four large scale twitter datasets. Further, speedup measure is used to illustrate the computation performance on varying number of nodes. Experimental results demonstrate that the proposed method is efficient in sentiment mining for the large scale twitter datasets.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call