ArHateDetector: detection of hate speech from standard and dialectal Arabic Tweets

Ramzi Khezzar,Abdelrahman Moursi,Zaher Al Aghbari

doi:10.1007/s43926-023-00030-9

Ramzi Khezzar, Abdelrahman Moursi + Show 1 more

Open Access

https://doi.org/10.1007/s43926-023-00030-9

Copy DOI

Journal: Discover Internet of Things	Publication Date: Mar 20, 2023
Citations: 8	License type: open-access

Affiliation: University of Sharjah

Abstract

Hate speech has become a phenomenon on social media platforms, such as Twitter. These websites and apps that were initially designed to facilitate our expression of free speech, are sometimes being used to spread hate towards each other. In the Arab region, Twitter is a very popular social media platform and thus the number of tweets that contain hate speech is increasing rapidly. Many tweets are written either in standard, dialectal Arabic, or mix. Existing work on Arabic hate speech are targeted towards either standard or single dialectal text, but not both. To fight hate speech more efficiently, in this paper, we conducted extensive experiments to investigate Arabic hate speech in tweets. Therefore, we propose a framework, called arHateDetector, that detects hate speech in the Arabic text of tweets. The proposed arHateDetector supports both standard and several dialectal Arabic. A large Arabic hate speech dataset, called arHateDataset, was compiled from several Arabic standard and dialectal tweets. The tweets are preprocessed to remove the unwanted content. We investigated the use of recent machine learning and deep learning models such as AraBERT to detect hate speech. All classification models used in the investigation are trained with the compiled dataset. Our experiments shows that AraBERT outperformed the other models producing the best performance across seven different datasets including the compiled arHateDataset with an accuracy of 93%. CNN and LinearSVC produced 88% and 89% respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ArHateDetector: detection of hate speech from standard and dialectal Arabic Tweets

Abstract

Talk to us

Similar Papers

More From: Discover Internet of Things

Lead the way for us

Similar Papers

Code-mixing unveiled: Enhancing the hate speech detection in Arabic dialect tweets using machine learning models.
Ali Alhazmi ... Christopher Ifeanyi Eke
PloS one | VOL. 19
Ali Alhazmi, et. al.Ali Alhazmi ... Christopher Ifeanyi Eke
17 Jul 2024
PloS one | VOL. 19

ABMM: Arabic BERT-Mini Model for Hate-Speech Detection on Social Media
Malik Almaliki ... Abdulqader M Almars
Electronics | VOL. 12
Malik Almaliki, et. al.Malik Almaliki ... Abdulqader M Almars
20 Feb 2023
Electronics | VOL. 12

Detection of Hate Speech in COVID-19-Related Tweets in the Arab Region: Deep Learning and Topic Modeling Approach.
Raghad Alshalan ... Heyam Al-Baity
Journal of Medical Internet Research | VOL. 22
Raghad Alshalan, et. al.Raghad Alshalan ... Heyam Al-Baity
08 Dec 2020
Journal of Medical Internet Research | VOL. 22

Bengali Hate Speech Detection in Public Facebook Pages
Nasif Istiak Remon ... Ranit Debnath Akash
-
Nasif Istiak Remon, et. al.Nasif Istiak Remon ... Ranit Debnath Akash
26 Feb 2022
26 Feb 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ArHateDetector: detection of hate speech from standard and dialectal Arabic Tweets

Abstract

Talk to us

Similar Papers

More From: Discover Internet of Things