Abstract

Social media is an online media that functions as a platform for users to participate, share, create, and exchange information through various forums and social networks. The rapid increase in social media activity causes an increase in the number of comments on social media. This is prone to triggering debate due to the easy formation of open discussions between social media users. However, the debate often triggers the emergence of negative things, causing great fights on social media. Social media users often use comments containing toxic words to argue and corner a party or group. This study conducted an experiment to detect comments containing toxic sentences on social media in Indonesia using a Pre-Trained Model that was trained for Indonesian. This study performed a multilabel classification and evaluated the classification results generated by the Multilingual BERT (MBERT), IndoBERT, and Indo Roberta Small models. The optimal result of this study is to use the IndoBERT model with an F1 Score of 0.8897.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call