A novel classification approach based on Naïve Bayes for Twitter sentiment analysis

Kyung-Tae Kim ,Sang-Young Kim ,Hee Yong Youn ,Byung Seok Lee ,Junseok Song

doi:10.3837/tiis.2017.06.011

Abstract

With rapid growth of web technology and dissemination of smart devices, social networking service(SNS) is widely used. As a result, huge amount of data are generated from SNS such as Twitter, and sentiment analysis of SNS data is very important for various applications and services. In the existing sentiment analysis based on the Naive Bayes algorithm, a same number of attributes is usually employed to estimate the weight of each class. Moreover, uncountable and meaningless attributes are included. This results in decreased accuracy of sentiment analysis. In this paper two methods are proposed to resolve these issues, which reflect the difference of the number of positive words and negative words in calculating the weights, and eliminate insignificant words in the feature selection step using Multinomial Naive Bayes(MNB) algorithm. Performance comparison demonstrates that the proposed scheme significantly increases the accuracy compared to the existing Multivariate Bernoulli Naive Bayes(BNB) algorithm and MNB scheme.

Full Text