Fake news detection in Urdu language using machine learning.

Muhammad Shoaib Farooq,Furqan Rustam,Ansar Naseem,Imran Ashraf

doi:10.7717/peerj-cs.1353

Abstract

With the rise of social media, the dissemination of forged content and news has been on the rise. Consequently, fake news detection has emerged as an important research problem. Several approaches have been presented to discriminate fake news from real news, however, such approaches lack robustness for multi-domain datasets, especially within the context of Urdu news. In addition, some studies use machine-translated datasets using English to Urdu Google translator and manual verification is not carried out. This limits the wide use of such approaches for real-world applications. This study investigates these issues and proposes fake news classier for Urdu news. The dataset has been collected covering nine different domains and constitutes 4097 news. Experiments are performed using the term frequency-inverse document frequency (TF-IDF) and a bag of words (BoW) with the combination of n-grams. The major contribution of this study is the use of feature stacking, where feature vectors of preprocessed text and verbs extracted from the preprocessed text are combined. Support vector machine, k-nearest neighbor, and ensemble models like random forest (RF) and extra tree (ET) were used for bagging while stacking was applied with ET and RF as base learners with logistic regression as the meta learner. To check the robustness of models, fivefold and independent set testing were employed. Experimental results indicate that stacking achieves 93.39%, 88.96%, 96.33%, 86.2%, and 93.17% scores for accuracy, specificity, sensitivity, MCC, ROC, and F1 score, respectively.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PeerJ Computer Science	Publication Date: May 23, 2023
Citations: 10	License type: CC BY 4.0

R Discovery Prime

Fake news detection in Urdu language using machine learning.

Abstract

Published Version

Talk to us

Similar Papers

More From: PeerJ Computer Science

Lead the way for us

Similar Papers

Fake News Detection Using Passive-Aggressive Classifier and Other Machine Learning Algorithms
K Nagashri ... J Sangeetha
-
K Nagashri, et. al.K Nagashri ... J Sangeetha
01 Jan 2020
01 Jan 2020

Exploring the Effect of N-grams with BOW and TF-IDF Representations on Detecting Fake News
Amal Esmail Qasem ... Mohammad Sajid
-
Amal Esmail Qasem, et. al.Amal Esmail Qasem ... Mohammad Sajid
25 Oct 2022
25 Oct 2022

Detection of Fake News Using Transformer Model
Momina Qazi ... Mazhar Ali
-
Momina Qazi, et. al.Momina Qazi ... Mazhar Ali
01 Jan 2020
01 Jan 2020

COVID-19 Fake News Detection in Malaysia – A Supervised Approach
Ramakrishnan Kalaimagal ... Balakrishnan Vimala
-
Ramakrishnan Kalaimagal, et. al.Ramakrishnan Kalaimagal ... Balakrishnan Vimala
23 Feb 2023
23 Feb 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Fake news detection in Urdu language using machine learning.

Abstract

Published Version

Talk to us

Similar Papers

More From: PeerJ Computer Science