Analisis DistilBERT dengan Support Vector Machine (SVM) untuk Klasifikasi Ujaran Kebencian pada Sosial Media Twitter

Naufal Azmi Verdikha,Reza Habid,Asslia Johar Latipah

doi:10.47002/metik.v7i2.583

Abstract

Hate speech is a significant issue in content management on social media platforms. Effective classification of hate speech plays a crucial role in maintaining a safe social media environment, combating discrimination, and protecting users. This study evaluates a hate speech classification model using SVM with linear and polynomial kernels. The dataset used consists of labeled Indonesian-language tweets. The importance of developing an effective classification model to address hate speech has led to the utilization of DistilBERT as a feature extraction method. However, DistilBERT has high-dimensional features, necessitating dimensionality reduction to reduce model complexity. Therefore, in this study, the PCA dimensionality reduction method is implemented with various scenarios of dimensionality, namely 10, 20, 30, 40, and 50. Evaluation is performed using F1-Score, and the entire study is evaluated using 10-fold cross-validation. The evaluation results indicate that in the scenario with a linear kernel, the model achieves the highest F1-Score of 0.75 in the 50-dimensional scenario. Meanwhile, in the scenario with a polynomial kernel, the model achieves the highest F1-Score of 0.7857 in the 50-dimensional scenario. These findings demonstrate that the use of a polynomial kernel with 50 dimensions yields the best performance in classifying hate speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Analisis DistilBERT dengan Support Vector Machine (SVM) untuk Klasifikasi Ujaran Kebencian pada Sosial Media Twitter

Abstract

Talk to us

Similar Papers

More From: METIK JURNAL

Lead the way for us

Journal: METIK JURNAL	Publication Date: Dec 30, 2023
License type: CC BY-SA 4.0

Similar Papers

Fast, Accurate and Robust Recognition Based On Local Normalized Linear Summation Kernel
Kazuhiro Hotta
-
Kazuhiro HottaKazuhiro Hotta
01 Dec 2007
01 Dec 2007

Combating the challenges of social media hate speech in a polarized society
Collins Udanor ... Chinatu C Anyanwu
Data Technologies and Applications | VOL. 53
Collins Udanor, et. al.Collins Udanor ... Chinatu C Anyanwu
13 Sep 2019
Data Technologies and Applications | VOL. 53

Heterogeneous Ensemble Structure based Universal Spam Profile Detection System for Social Media Networks
Vinod A M ... Sathish G C
International Journal of Recent Technology and Engineering (IJRTE) | VOL. 9
Vinod A M, et. al.Vinod A M ... Sathish G C
30 May 2020
International Journal of Recent Technology and Engineering (IJRTE) | VOL. 9

Putting the Toothpaste Back in the Tube: Against Online Hate Speech.
Brenda K Wiederhold
Cyberpsychology, behavior and social networking | VOL. 26
Brenda K WiederholdBrenda K Wiederhold
13 Jun 2023
Cyberpsychology, behavior and social networking | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analisis DistilBERT dengan Support Vector Machine (SVM) untuk Klasifikasi Ujaran Kebencian pada Sosial Media Twitter

Abstract

Talk to us

Similar Papers

More From: METIK JURNAL