Abstract

The huge impact caused by the COVID-19 pandemic has made many people express their opinions on Twitter social media. There are various responses given by the community that are negative and positive. The dataset comes from kaggle with more than 750 tweets of data. Classification designed by the Naive Bayes method. Implementation through preprocessing, case folding, tokenizing, stopword removal, TF-IDF, and cross validation has been able to produce quite high accuracy. After classification, validation will be carried out with Cross Fold Validation. The best value is on cv5 where accuracy = 0.847, precision = 0.855, recall = 0.83, and f1 score = 0.842.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call