Abstract

<p align="justify">Sentiment analysis on unbalanced data will cause classification errors where the classification results tend to be in the majority class. Therefore, it is necessary to handle unbalanced data. In this study, a combination of synthetic minority oversampling technique (SMOTE) and Tomek link methods will be used to handle unbalanced data. In this study, we use the Recurrent Neural Network (RNN) method to analyze the sentiment of Shopee application users based on review data. Shopee Indonesia application review data shows that around 80% of Shopee application users have positive sentiments and 20% have negative sentiments, which means the data is not balance. In this study, preprocessing process with combination of synthetic minority oversampling technique (SMOTE) and Tomek link method used to handle the condition. The performance of the result is quite good, namely 80% accuracy, 84.1% precision, 92.5% sensitivity, 30% specificity, and 88.1% F1-score. It is better than performance of sentiment analysis that without preprocessing to handle imbalanced data.</p><p><strong>Keywords</strong><strong>: </strong>sentiment analysis; imbalanced data; Tomek link; SMOTE; RNN</p>

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call