Abstract
User reviews can vary widely in language and writing style, which can make accurate sentiment modeling difficult. Selecting the right machine learning model and comparing performance between models can be challenging, given that each model has its own strengths and weaknesses. The method used involved data collection by scraping 5000 reviews from the Google Play Store, followed by data pre-processing including data cleaning, tokenization, stemming, and feature engineering using TF-IDF. The data was divided into training (70%) and testing (30%) sets, with the SMOTE oversampling technique applied to address class imbalance. Three machine learning models were used: Random Forest, Support Vector Machine (SVM), and Naive Bayes. The results showed that the majority of reviews were positive, with a high average app rating. Word cloud analysis revealed that “service”, “driver”, “price”, and “time” were the most frequently discussed aspects in the reviews. In terms of model performance, SVM performed the best with an accuracy of 91.3%, followed by Random Forest (89%) and Naive Bayes (78%). Maxim was generally well received by users in Indonesia, with the majority of reviews being positive. The SVM model proved to be the most effective in classifying review sentiment, outperforming other models in accuracy and precision.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: International Journal of Quantitative Research and Modeling
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.