Abstract

Sentiment classification is to find the polarity of product or user reviews. Supervised machine learning algorithms is used for opinion mining such as naive Bayes, K-nearest neighbour, decision trees, maximum entropy and hidden Markov model and support vector machine. KNN is a simple algorithm, but a less efficient classification algorithm. In this paper, we propose an improved KNN algorithm. An optimised feature selection, genetic algorithm that incorporates the information gain for feature selection and combined with bagging technique and KNN for improving the accuracy of sentiment classification. Specifically, we compared two approaches and traditional KNN for sentiment classification of movie reviews and product reviews. The same approach has been applied to other machine learning algorithms such as support vector machine and naive Bayes and the result is compared with POS-based feature set method. The proposed method is evaluated and experimental results using information gain, genetic algorithm with bagging technique indicate higher performance result with accuracy of 87.50% of the movie reviews and exhibits better performance in terms of accuracy, precision and recall for movie, DVD, electronics and kitchen reviews.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.