Comparative Analysis of SVM, XGBoost and Neural Network on Hate Speech Classification

Suwarno Liang

doi:10.29207/resti.v5i5.3506

Abstract

In social media, it is found that hate speech is conveyed in the form of text, images and videos, as a result it can provoke certain people to do things that are against the law and harm other person. Therefore, it is necessary to make early detection of hate speech by utilizing machine learning algorithms. This study is to analyze the level of accuracy, precision, recall and F1-Score of 3 kinds of algorithms (SVM, XGBoost, and Neural Network) in the classification of hate speech, using datasets sourced from public hate speech on Twitter in Indonesian. The results of the analysis show that the SVM algorithm has a level of accuracy (83.2%), precision (83%), recall (83%) and F1-score (83%), SVM occupies the highest level compared to XGBoost and Neural Network, so the SVM algorithm can be considered for use in hate speech classification

Highlights

In social media, it is found that hate speech is conveyed in the form of text, images and videos, as a result it can provoke certain people to do things that are against the law and harm other person
Hasil analisis menunjukkan algoritma Support Vector Machine (SVM) memiliki tingkat accuracy (83.2%), precision (83%), recall (83%) dan F1-score (83%), SVM menduduki tertinggi dibanding XGBoost dan Neural Network, sehingga algoritma SVM dapat dipertimbangkan untuk digunakan dalam klasifikasi ujaran kebencian
Model akan melakukan klasifikasi hate speech berdasarkan teks tersebut dengan menggunakan model Support Vector Machine (SVM) yang sebelumnya dikonversi dalam bentuk .joblib

Summary

Pendahuluan penanganan kasus ujaran kebencian pada saat ini tidak

Banyak orang mengungkapkan pendapat di depan umum. Hal ini dikarenakan pemerintah Indonesia menjamin bahwa setiap warga negaranya memiliki hak kebebasan untuk menyuarakan pendapatnya [1]. Data Dari penelitian yang telah disebutkan diatas, tinjauan audio perlu diubah menjadi data teks, yang kemudian penelitian terdahulu dapat disimpulkan ke dalam Tabel digunakan untuk klasifikasi. Dilakukan stemming untuk teks-teks yang telah Penelitian ini menganalisis tiga algoritma yang optimal bersufiks dengan menggunakan Perpustakaan Sastrawi dalam melakukan klasifikasi ujaran kebencian. Penelitian yang dilakukan oleh [12] menunjukkan akan diubah menjadi teks, penulis akan menggunakan bahwa dengan adanya persebaran distribusi data dalam salah satu model pembelajaran mesin (SVM, setiap kelas akan mempengaruhi hasil accuracy model XGBOOST, atau Neural Network) yang memiliki hasil yang akan digunakan. Sehingga perlunya dilakukan terbaik akan digunakan untuk mengklasifikasikan ujaran pemerataan kelas, salah satunya dengan menggunakan kebencian. Memiliki dampak accuracy yang tidak signifikan Natural Language Toolkit (NLTK) digunakan dalam terhadap model dan atribut ini merupakan kata-kata proses ini untuk fungsi tokenisasi kata.

Stemming

Data Preparation

Case-Folding

Dataset Balancing

Neural Network

Evaluasi

Findings

Evaluasi Neural Network

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)	Publication Date: Oct 24, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Comparative Analysis of SVM, XGBoost and Neural Network on Hate Speech Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)

Lead the way for us

Similar Papers

Hate Speech Classification in Indonesian Language Tweets by Using Convolutional Neural Network
Dewa Ayu Nadia Taradhita ... I Ketut Gede Darma Putra
Journal of ICT Research and Applications | VOL. 14
Dewa Ayu Nadia Taradhita, et. al.Dewa Ayu Nadia Taradhita ... I Ketut Gede Darma Putra
23 Feb 2021
Journal of ICT Research and Applications | VOL. 14

Enhanced Seagull Optimization with Natural Language Processing Based Hate Speech Detection and Classification
Yousef Asiri ... Romany F Mansour
Applied Sciences | VOL. 12
Yousef Asiri, et. al.Yousef Asiri ... Romany F Mansour
10 Aug 2022
Applied Sciences | VOL. 12

Exploration of Multi-corpus Learning for Hate Speech Classification in Low Resource Scenarios
Ashwin Geet D’Sa ... Dominique Fohr
-
Ashwin Geet D’Sa, et. al.Ashwin Geet D’Sa ... Dominique Fohr
01 Jan 2021
01 Jan 2021

Impact of Politically Biased Data on Hate Speech Classification
Maximilian Wich ... Georg Groh
-
Maximilian Wich, et. al.Maximilian Wich ... Georg Groh
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparative Analysis of SVM, XGBoost and Neural Network on Hate Speech Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)