Building an Optimal Dataset for Arabic Fake News Detection

Mohammad A Bsoul,Abdallah Qusef,Saleh Abu-Soud

doi:10.1016/j.procs.2022.03.088

Mohammad A Bsoul, Abdallah Qusef + Show 1 more

Open Access

https://doi.org/10.1016/j.procs.2022.03.088

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Fake news detection for Arabic news has drawn some attention recently. However, the number of such studies are limited due to the lack of datasets that can be used to perform them. Clickbait detection is typically linked to fake news detection as clickbaits are effective in spreading fake news. The lack of dataset in the Arabic language to study clickbait detection models is also evident. This paper presents a dataset of Arabic clickbait news for the first time. The purpose of this dataset is to enable the automatic classification of news headlines as “Clickbait” or “Not Clickbait” using a machine learning model. More than 3000 news records are sampled from five months of tweets for 24 Jordanian news publishers. All sampled news records are labeled by three annotators and that resulted in 18% clickbait news records. The annotator unanimously agreed on the class of about 81% of the labeled news records. To showcase the usability of the resulting dataset in machine learning, Logistic Regression, Support Vector Machine, Random Forrest, Naïve Bayes, Stochastic Gradient Descent, Nearest Neighbor, and Decision Tree are applied to this dataset. These models produced Macro F1-Score value up to 0.81 indicating that the automatic detection of clickbait news headlines using machine learning is feasible.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Procedia Computer Science	Publication Date: Jan 1, 2022
Citations: 7	License type: cc-by-nc-nd

R Discovery Prime

Building an Optimal Dataset for Arabic Fake News Detection

Abstract

Published Version

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Similar Papers

Detection of Online Fake News Using Blending Ensemble Learning
Arvin Hansrajh ... Timothy T Adeliyi
Scientific Programming | VOL. 2021
Arvin Hansrajh, et. al.Arvin Hansrajh ... Timothy T Adeliyi
28 Jul 2021
Scientific Programming | VOL. 2021

An Empirical Comparison of Machine Learning Models for Student’s Mental Health Illness Assessment
Prathamesh Muzumdar ... Ganga Prasad Basyal
Asian Journal of Computer and Information Systems | VOL. 10
Prathamesh Muzumdar, et. al.Prathamesh Muzumdar ... Ganga Prasad Basyal
27 Feb 2022
Asian Journal of Computer and Information Systems | VOL. 10

Next-Gen Proppant Cleanout Operations: Machine Learning for Bottom-Hole Pressure Prediction
Samuel A Thabet ... Ahmed Helmy
-
Samuel A Thabet, et. al.Samuel A Thabet ... Ahmed Helmy
20 Oct 2024
20 Oct 2024

A Machine Learning Approach to Fake News Detection Using Support Vector Machine (SVM) and Unsupervised Learning Model
I.G Hosea ... I Ismaila
Advances in Multidisciplinary and scientific Research Journal Publication | VOL. 11
I.G Hosea, et. al.I.G Hosea ... I Ismaila
11 Jul 2023
Advances in Multidisciplinary and scientific Research Journal Publication | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Building an Optimal Dataset for Arabic Fake News Detection

Abstract

Published Version

Talk to us

Similar Papers

More From: Procedia Computer Science