Abstract

as a popular social media in the world and even in Indonesia, Twitter has a variety of popular topics making these topics trending, including the topic of natural disasters that have occurred in Indonesia. The DKI Jakarta flood disaster in early 2020 made a big scene on trending twitter topics. This study aims to classify these tweets into "flooded" and "not flooded" predictions with the tweets and geospatial features. The model proposed for classifying is BERT-MLP. Bidirectional Encoder from Transformers (BERT) is used in the pre-trained model to classify these tweets and Multi Layer Perceptron (MLP) is used to classify geospatial features. The scenario designed for the model focuses on the preprocessing of tweets as follows without stopword removal, without stemming, with both, and without both. Once classified, the tweet will be visualized into a two-dimensional interactive map. The best scenario results have an accuracy of 82% in scenarios without stemming and with stopword removal. This is due to the stemming process eliminates some of the features in tweets around 6%. This study also shows the relationship between the influence of negative context tweets on the "not flooded" class with an orientation of 65% of the total data. However, defining manual stopwords can affect because stopword removal will not delete words that still have context related features to the topic.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.