Indonesian Hate Speech Text Classification Using Improved K-Nearest Neighbor with TF-IDF-ICSρF

Nova Adi Saputra,Nurul Mega Saraswati,Khurotul Aeni

doi:10.15294/sji.v11i1.48085

Abstract

Purpose: Freedom in social media gives rise to the possibility of disturbing users through the sentences they send, which is limited by the Electronic Information and Transactions Law (UU ITE). This research aims to find an effective method for classifying hate speech text data, especially in Indonesian, with many categories expected to minimize this case.Methods: This study used 1.000 data from Twitter with five labels, including religion, race, physical, gender and other (invective or slander). The process started with several steps of preprocessing, data transformation using TF-IDF-ICSρF term weighting and data mining using an Improved KNN algorithm. Then, the results were compared with the TF-IDF and KNN methods to evaluate the differences.Result: Using TF-IDF-ICSρF and Improved KNN algorithms gets an average accuracy value of 88.11%, 17.81% higher compared with the same data and parameters to the K-Nearest Neighbor and TF-IDF algorithms, which get results of 70.30%.Novelty: Based on the comparison results, TF-IDF-ICSρF and Improved KNN methods can effectively classify hate speech sentences that have many labels with fairly good accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Indonesian Hate Speech Text Classification Using Improved K-Nearest Neighbor with TF-IDF-ICSρF

Abstract

Talk to us

Similar Papers

More From: Scientific Journal of Informatics

Lead the way for us

Journal: Scientific Journal of Informatics	Publication Date: Feb 25, 2024
License type: cc-by

Similar Papers

Anticipation of the ITE Law and Reconciliation of Its Forms Freedom of Expression through the E-Hights Website
Wita Setyaningrum ... Retno Damarina
Jurnal Hukum Novelty | VOL. 13
Wita Setyaningrum, et. al.Wita Setyaningrum ... Retno Damarina
24 Dec 2022
Jurnal Hukum Novelty | VOL. 13

The Role of Criminal Law in Overcoming Negative Content on Social Media : A Perspective on Information Law And Electronic Transactions
Tonny Laos ... Nur Handayati
International Journal of Scientific Research in Science and Technology | VOL. -
Tonny Laos, et. al. Tonny Laos ... Nur Handayati
10 Sep 2023
International Journal of Scientific Research in Science and Technology | VOL. -

جريمة التشهير عبر وسائل التواصل الاجتماعي في نظر القانون الوضعي الإندونيسي والشريعة الإسلامية
Dwi Pramono
Al-Zahra : Journal for Islamic and Arabic Studies | VOL. 19
Dwi PramonoDwi Pramono
12 Jun 2022
Al-Zahra : Journal for Islamic and Arabic Studies | VOL. 19

KONTEN ILEGAL (ILLEGAL CONTENT): SEBUAH TINDAK PIDANA MENURUT UNDANG-UNDANG INFORMASI DAN TRANSAKSI ELEKTRONIK (UU ITE)
-
JURNAL SISTEM INFORMASI UNIVERSITAS SURYADARMA | VOL. 11
--
03 Jun 2014
JURNAL SISTEM INFORMASI UNIVERSITAS SURYADARMA | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Indonesian Hate Speech Text Classification Using Improved K-Nearest Neighbor with TF-IDF-ICSρF

Abstract

Talk to us

Similar Papers

More From: Scientific Journal of Informatics