Abusive Language and Hate Speech Detection for Indonesian-Local Language in Social Media Text

Shofianina Dwi Ananda Putri,Indra Budi,Muhammad Okky Ibrohim

doi:10.1007/978-3-030-79757-7_9

Abstract

In social media, people are free to express their feelings and thoughts. However, people can also use abusive language and hate speech to insult or humiliate individuals or groups on social media, such as Twitter. Various detection methods have been developed to control the spread of abusive language and hate speech in Indonesia, but the detection process is still focused on monolingual. As a country with various ethnicities and cultures, Indonesia also has a variety of local languages. This study examines abusive language and hate speech detection on Twitter, which also contains five local languages, including Javanese, Sundanese, Madurese, Minangkabau, and Musi. In this work, we present a preliminary evaluation to find the best performance of machine learning methods in detecting abusive language and hate speech on Twitter as preliminary study for each local language. We use several machine learning algorithms, such as Naïve Bayes (NB), Support Vector Machine (SVM), and Random Forest Decision Tree (RFDT) as classifiers and TF-IDF weighted word n-gram and character-n gram as feature extraction. The experiments use the 5-Fold cross-validation approach and evaluated by measuring the F-1-Score. After the experiment, we have obtained the SVM classifier with word n-gram features show the best F-1-Score for each dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Abusive Language and Hate Speech Detection for Indonesian-Local Language in Social Media Text

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Multi-label Hate Speech and Abusive Language Detection in Indonesian Twitter
Muhammad Okky Ibrohim ... Indra Budi
-
Muhammad Okky Ibrohim, et. al.Muhammad Okky Ibrohim ... Indra Budi
01 Jan 2019
01 Jan 2019

Multi-label Classification for Hate Speech and Abusive Language in Indonesian-Local Languages
Ajeng Dwi Asti ... Muhammad Okky Ibrohim
-
Ajeng Dwi Asti, et. al.Ajeng Dwi Asti ... Muhammad Okky Ibrohim
23 Oct 2021
23 Oct 2021

Hate Speech and Abusive Language and Abusive Language Detection in Twitter using Machine Learning
Sakshi Dhatrak ... Sakshi Bodke
International Journal of Advanced Research in Science, Communication and Technology | VOL. -
Sakshi Dhatrak, et. al. Sakshi Dhatrak ... Sakshi Bodke
09 Mar 2024
International Journal of Advanced Research in Science, Communication and Technology | VOL. -

Hierarchical Multi-label Classification to Identify Hate Speech and Abusive Language on Indonesian Twitter
Faizal Adhitama Prabowo ... Muhammad Okky Ibrohim
-
Faizal Adhitama Prabowo, et. al.Faizal Adhitama Prabowo ... Muhammad Okky Ibrohim
01 Sep 2019
01 Sep 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Abusive Language and Hate Speech Detection for Indonesian-Local Language in Social Media Text

Abstract

Talk to us

Similar Papers