Abstract

Twitter sentiment analysis has become an effective way in measuring public sentiment about a certain topic or product. Thus, researchers have worked extensively in recent years to build efficient models for sentiment classification. In this paper, we will measure the effect of varying the training set size on the classification accuracy and F-score of SVM and Naive Bayes classifiers. We will expand our study even further by forming two ensembles: Ensemble 1 and Ensemble 2. Both ensembles include a single Naive Bayes and SVM classifier, but the ensembles differ in terms of the decision fusion technique utilized. Ensemble 1 uses ‘AND-type’ fusion while Ensemble 2 uses ‘OR-type’ fusion. In this paper, we measure the effect of training set size on each ensemble configuration type by measuring their F-scores and classification accuracies while varying the training set size.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call