A Comparative Study of Transformer-based Models for Hate-Speech Detection in English-Kiswahili Code-Switched Social Media Text

doi:10.30534/ijatcse/2024/011352024

Abstract

The transformer architecture, first introduced in 2017 by researchers at Google, has revolutionized natural language processing in various tasks, including text classification. This architecture formed the basis of future models such as those used in hate speech detection in code-switched text. In this research, we conduct a comparative study of transformer-based models for hate speech detection in English-Kiswahili code-switched text. First, the models were compared as feature extractors using a traditional classifier and then as end-to-end classifiers. The three multilingual transformer-based models compared include mBERT, mDistilBERT and XLM-RoBERTa, using SVM as the traditional classifier for the extracted features. The HateSpeech_Kenya dataset, sourced from Kaggle, was utilized in this study. As a feature extractor, mBERT’s hidden states trained the highest-performing SVM with an accuracy of 0.5461 and a macro f1 score of 0.40. Among the three models evaluated, XLM-RoBERTa achieved the highest accuracy of 0.6069 and a macro f1 score of 0.49 on a balanced dataset. In contrast, mBERT achieved the highest accuracy of 0.7820 and a macro f1 score of 0.53 on an imbalanced dataset. The comparative study establishes that using transformer-based models as end-to-end classifiers generally performs better than using them as feature extractors with traditional classifiers. This is because directly training the models allows them to learn more task-specific features. Furthermore, the varying performance across balanced and imbalanced datasets highlights the need for careful model selection based on the dataset characteristics and specific task requirements.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Comparative Study of Transformer-based Models for Hate-Speech Detection in English-Kiswahili Code-Switched Social Media Text

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Trends in Computer Science and Engineering

Lead the way for us

Similar Papers

Soil textural class modeling using digital soil mapping approaches: Effect of resampling strategies on imbalanced dataset predictions
Fereshteh Mirzaei ... Ruth Kerry
Geoderma Regional | VOL. 38
Fereshteh Mirzaei, et. al.Fereshteh Mirzaei ... Ruth Kerry
15 Jun 2024
Geoderma Regional | VOL. 38

Predicting Spine Surgery Complications Using Machine Learning
Mohamad Hoda ... Philippe Phan
-
Mohamad Hoda, et. al.Mohamad Hoda ... Philippe Phan
01 Jul 2019
01 Jul 2019

Performance Evaluation of Sentiment Analysis on Balanced and Imbalanced Dataset Using Ensemble Approach
Shini George ... V Srividhya
Indian Journal of Science and Technology | VOL. 15
Shini George, et. al.Shini George ... V Srividhya
05 May 2022
Indian Journal of Science and Technology | VOL. 15

Code-mixing unveiled: Enhancing the hate speech detection in Arabic dialect tweets using machine learning models.
Ali Alhazmi ... Christopher Ifeanyi Eke
PloS one | VOL. 19
Ali Alhazmi, et. al.Ali Alhazmi ... Christopher Ifeanyi Eke
17 Jul 2024
PloS one | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comparative Study of Transformer-based Models for Hate-Speech Detection in English-Kiswahili Code-Switched Social Media Text

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Trends in Computer Science and Engineering