Enhancing hate speech detection in Indonesian using abusive words lexicon

Endang Wahyu Pamungkas,Sohail Akhtar,Dian Purworini,Divi Galih Prasetyo Putri

doi:10.11591/ijeecs.v33.i1.pp450-462

Endang Wahyu Pamungkas, Sohail Akhtar + Show 2 more

Open Access

https://doi.org/10.11591/ijeecs.v33.i1.pp450-462

Copy DOI

Abstract

Hate speech is a major challenge in Indonesia, a diverse country with multiple languages and a dynamic online landscape. This research explores the phenomenon of hate speech and its detection, particularly in language contexts with limited resources. We introduce a new abusive words lexicon, created by collecting words from various sources, adapted for Indonesian, Javanese and Sundanese. Our study investigates the practical implementation of this lexicon. We conducted extensive experiments using different datasets and machine learning models, aiming to improve hate speech detection. The results consistently show a positive impact of the lexicon, which significantly improves detection, especially in languages with fewer resources. But this research paves the way for further exploration. The lexicon can be expanded, broadening its scope. Additionally, we suggest investigating more sophisticated models, such as transformerbased models, to more effectively detect hate speech. In a world where hate speech is a growing problem, our research provides valuable insights and tools to combat it effectively in Indonesia and other countries.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing hate speech detection in Indonesian using abusive words lexicon

Abstract

Talk to us

Similar Papers

More From: Indonesian Journal of Electrical Engineering and Computer Science

Lead the way for us

Journal: Indonesian Journal of Electrical Engineering and Computer Science	Publication Date: Jan 1, 2024
License type: CC BY-NC 4.0

Similar Papers

Hierarchical Sentiment Analysis Framework for Hate Speech Detection: Implementing Binary and Multiclass Classification Strategy
Faria Naznin ... Shahran Rahman Alve
Cognizance Journal of Multidisciplinary Studies | VOL. 4
Faria Naznin, et. al.Faria Naznin ... Shahran Rahman Alve
30 Aug 2024
Cognizance Journal of Multidisciplinary Studies | VOL. 4

Sinhala Hate Speech Detection in Social Media using Text Mining and Machine learning
H.M.S.T Sandaruwan ... S.A.S Lorensuhewa
-
H.M.S.T Sandaruwan, et. al.H.M.S.T Sandaruwan ... S.A.S Lorensuhewa
01 Sep 2019
01 Sep 2019

Roman Urdu Hate Speech Detection Using Transformer-Based Model for Cyber Security Applications.
Muhammad Bilal ... Atif Khan
Sensors | VOL. 23
Muhammad Bilal, et. al.Muhammad Bilal ... Atif Khan
12 Apr 2023
Sensors | VOL. 23

Evaluating Machine Learning Techniques for Detecting Offensive and Hate Speech in South African Tweets
Oluwafemi Oriola ... Eduan Kotze
IEEE Access | VOL. 8
Oluwafemi Oriola, et. al.Oluwafemi Oriola ... Eduan Kotze
01 Jan 2020
IEEE Access | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing hate speech detection in Indonesian using abusive words lexicon

Abstract

Talk to us

Similar Papers

More From: Indonesian Journal of Electrical Engineering and Computer Science