Abstract

The eyewitness message on twitter as a social network sensor aims to determine the classification process's performance. In the classification of flood disaster messages, preprocessing data is required before the classification process is carried out. Preprocessing affects the resulting level of accuracy in the classification process. Stopword removal is part of preprocessing so that the effect of stopword removal on classification performance will be examined. Support Vector Machine (SVM) is used to classify by weighting words using Term Frequency-Inverse Document Frequency (TF-IDF). The data taken from Twitter is 3000 data with 1000 data each for each label. The effect of the stopword on accuracy performance can be seen in several experiments that have been carried out. We had already conducted three different experiments, and the highest level of accuracy was 76.6%, 76.87%, and 77.87%. Based on the experiments that have been carried out, stopword is very influential on the accuracy generated by the classification of flood disaster messages on Twitter.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call