Abstract

The availability of social media, blogs, and websites to everyone creates a lot of problems. False news is a critical issue that can affect individuals or entire countries. Fake news can be created and shared all over the world. The 2016 presidential election in the United States illustrates that problem. As a result, controlling social media is essential. Machine learning (ML) algorithms help to detect fake news automatically. This article proposes a framework for detecting fake news based on feature extraction and feature selection algorithms and a set of voting classifiers. The proposed system distinguishes fake news from real news. First, we preprocessed the data taking unnecessary characters and numbers and reducing the words in the dictionary (lemmatization). Second, we extracted some important features using two feature extraction types: the term frequency-inverse document frequency (TF-IDF) technique and the DOC2VEC algorithm, a word embedding technique. Third, the extracted characteristics were reduced with the help of the chi-square algorithm and the analysis of variance (ANOVA) algorithm. We used three data sets that are published online: Media-Eval, Fake-or-Real-News, and ISOT. To evaluate the proposed framework, we used five different performance metrics: accuracy (ACC), the area under the curve (AUC), precision, recall, and f1-score. Our system achieved 94.6% of ACC for the Fake-or-Real dataset. For the Media-Eval dataset, the system achieved 92.3% of ACC. For the ISOT dataset, the system achieved 100% of ACC. We contrast the proposed framework with several other classification algorithms. The experimental results show that the proposed framework outperforms the existing works in terms of ACC by 0.2% for the ISOT dataset.

Highlights

  • One of the consequences of technology is fake news. It is misinformation or misleading information offered as facts that can affect a person's opinion. This false information has several goals; organizations can use it for financial purposes (e.g., Facebook pages used it to spread fake news leading to specific ads) or for political purposes

  • The authors observed that the untrusted website visits are not observed by 3.3% untrusted news versus 6.2% trust news for Google, or 1% untrusted versus 1.5% trust news for Twitter [1]

  • This paper enhances the performance of the conventional Machine learning (ML) algorithm in detecting fake news because the dataset utilized didn't have a large enough amount of data to feed the deep learning algorithm [8]

Read more

Summary

INTRODUCTION

One of the consequences of technology is fake news It is misinformation or misleading information offered as facts that can affect a person's opinion. For example, is leading the fight to tackle fake news through education They teach primary school students how to combat false news and develop media literacy skills. Because misinformation tries to spread false claims, knowledge-based approaches use international sources to fact-check the truthfulness in any news content. This paper enhances the performance of the conventional ML algorithm in detecting fake news because the dataset utilized didn't have a large enough amount of data to feed the deep learning algorithm [8].

RELATED WORK
Feature Reduction
EXPERIMENTAL RESULTS
Hardware and software specifications
Methods
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call