Abstract

Opinion mining or sentiment analysis is a part of text mining and widespread topic nowadays. Opinion mining is the process of understanding, extracting, and processing textual data automatically to get sentiment information cointained in a sentence. One of the opinion mining method that can be used to analyzed text documents is classification. This research aims to classify Indonesian news into three classes of positive, negative, and neutral using Multinomial Naive Bayes. To get optimal result, the author tries to add some feature selections using Document Frequency Thresholding (DF-Thresholding) and Term Weighting using Term Frequency-Inverse Document Frequency (TF-IDF). The result showed that the classification using Multinomial Naive Bayes obtained the highest accuracy with an average 92.44%, Multinomial Naive Bayes with DF-Thresholding had an accuracy of 83,44%, and using Multinomial Naive Bayes with Term Frequency-Inverse Document Frequency (TF-IDF) get an accuracy 78,33%. The actual purpose of using the feature selection in this research to add accuracy value, but the result show less influence in terms of accuracy. Using the selection feature can reduce the use of term dimension.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call