Separating Hate Speech from Abusive Language on Indonesian Twitter

Muhammad Amien Ibrahim,Puguh Wahyu Prasetyo,Nerru Pranuta Murnaka,Samsul Arifin,Rinda Nariswari,Noviyanti Tri Maretta Sagala

doi:10.1109/icodsa55874.2022.9862850

Muhammad Amien Ibrahim, Puguh Wahyu Prasetyo + Show 4 more

Open Access

https://doi.org/10.1109/icodsa55874.2022.9862850

Copy DOI

Abstract

Social media is an effective tool for connecting with people and distributing information. However, many people often use social media to spread hate speech and abusive languages. In contrast to hate speech, abusive languages are frequently used as jokes with no purpose of offending individuals or groups, even though they may contain profanities. As a result, the distinction between hate speech and abusive language is often blurred. In many cases, individuals who spread hate speech may be prosecuted as it has legal implications. Previous research has focused on binary classification of hate speech and normal tweets. This study aims to classify hate speech, abusive language, and normal messages on Indonesian Twitter. Several machine learning models, such as logistic regression and BERT models, are utilized to accomplish text classification tasks. The model's performance is assessed using the F1-Score evaluation metric. The results show that BERT models outperform other models in terms of F1-Score, with the BERT-indobenchmark model, which was pretrained on social media text data, achieving the highest F1-Score of 85.59. This also demonstrates that pretraining the BERT model using social media data improves the classification model significantly. Developing such classification model that can distinguish between hate speech and abusive language would help individuals in preventing the spread of hate speech that has legal implications.

Full Text