HAPI: An efficient Hybrid Feature Engineering-based Approach for Propaganda Identification in social media.

Akib Mohi Ud Din Khanday,Syed Tanzeel Rabani,Ahmed A Abd El-Latif,Qamar Rayees Khan,Mudasir Ahmad Wani

doi:10.1371/journal.pone.0302583

Abstract

Social media platforms serve as communication tools where users freely share information regardless of its accuracy. Propaganda on these platforms refers to the dissemination of biased or deceptive information aimed at influencing public opinion, encompassing various forms such as political campaigns, fake news, and conspiracy theories. This study introduces a Hybrid Feature Engineering Approach for Propaganda Identification (HAPI), designed to detect propaganda in text-based content like news articles and social media posts. HAPI combines conventional feature engineering methods with machine learning techniques to achieve high accuracy in propaganda detection. This study is conducted on data collected from Twitter via its API, and an annotation scheme is proposed to categorize tweets into binary classes (propaganda and non-propaganda). Hybrid feature engineering entails the amalgamation of various features, including Term Frequency-Inverse Document Frequency (TF-IDF), Bag of Words (BoW), Sentimental features, and tweet length, among others. Multiple Machine Learning classifiers undergo training and evaluation utilizing the proposed methodology, leveraging a selection of 40 pertinent features identified through the hybrid feature selection technique. All the selected algorithms including Multinomial Naive Bayes (MNB), Support Vector Machine (SVM), Decision Tree (DT), and Logistic Regression (LR) achieved promising results. The SVM-based HaPi (SVM-HaPi) exhibits superior performance among traditional algorithms, achieving precision, recall, F-Measure, and overall accuracy of 0.69, 0.69, 0.69, and 69.2%, respectively. Furthermore, the proposed approach is compared to well-known existing approaches where it overperformed most of the studies on several evaluation metrics. This research contributes to the development of a comprehensive system tailored for propaganda identification in textual content. Nonetheless, the purview of propaganda detection transcends textual data alone. Deep learning algorithms like Artificial Neural Networks (ANN) offer the capability to manage multimodal data, incorporating text, images, audio, and video, thereby considering not only the content itself but also its presentation and contextual nuances during dissemination.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

HAPI: An efficient Hybrid Feature Engineering-based Approach for Propaganda Identification in social media.

Abstract

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Journal: PloS one	Publication Date: Jul 10, 2024
License type: CC BY 4.0

Similar Papers

Exploring the Effect of N-grams with BOW and TF-IDF Representations on Detecting Fake News
Amal Esmail Qasem ... Mohammad Sajid
-
Amal Esmail Qasem, et. al.Amal Esmail Qasem ... Mohammad Sajid
25 Oct 2022
25 Oct 2022

Automatic Fake News Detector in Social Media Using Machine Learning and Natural Language Processing Approaches
J Srinivas ... P Varaprasada Rao
-
J Srinivas, et. al.J Srinivas ... P Varaprasada Rao
01 Jan 2020
01 Jan 2020

COVID-19 Fake News Detection in Malaysia – A Supervised Approach
Ramakrishnan Kalaimagal ... Soo Mun Chong
-
Ramakrishnan Kalaimagal, et. al.Ramakrishnan Kalaimagal ... Soo Mun Chong
23 Feb 2023
23 Feb 2023

Phony News Detection in Reddit Using Natural Language Techniques and Machine Learning Pipelines
Srinivas Jagirdar ... Venkata Subba K Reddy
International Journal of Natural Computing Research | VOL. 10
Srinivas Jagirdar, et. al.Srinivas Jagirdar ... Venkata Subba K Reddy
01 Jul 2021
International Journal of Natural Computing Research | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HAPI: An efficient Hybrid Feature Engineering-based Approach for Propaganda Identification in social media.

Abstract

Talk to us

Similar Papers

More From: PloS one