Automatic Fake News Detection for Romanian Online News

Marius Cristian Buzea,Stefan Trausan-Matu,Traian Rebedea

doi:10.3390/info13030151

Abstract

This paper proposes a supervised machine learning system to detect fake news in online sources published in Romanian. Additionally, this work presents a comparison of the obtained results by using recurrent neural networks based on long short-term memory and gated recurrent unit cells, a convolutional neural network, and a Bidirectional Encoder Representations from Transformers (BERT) model, namely RoBERT, a pre-trained Romanian BERT model. The deep learning architectures are compared with the results achieved by two classical classification algorithms: Naïve Bayes and Support Vector Machine. The proposed approach is based on a Romanian news corpus containing 25,841 true news items and 13,064 fake news items. The best result is over 98.20%, achieved by the convolutional neural network, which outperforms the standard classification methods and the BERT models. Moreover, based on irony detection and sentiment analysis systems, additional details are revealed about the irony phenomenon and sentiment analysis field which are used to tackle fake news challenges.

Highlights

Over the recent years, artificial intelligence (AI) brought important changes in the domain of information technologies and architectures, such as using and developing intelligent transportation systems, virtual personal assistants, robotic surgery, and maybe with the greatest impact on our lives, natural language processing [1].Nowadays, due to the internet, the quality and quantity of the news increases every day, and the way that the consumer accesses and manages daily online information is constantly changing
This paper focuses on analyzing the performance of several models for fake news detection in Romanian by using neural network architectures such as long short-term memory (LSTM), a convolutional neural network (CNN), gated recurrent units (GRU), Bidirectional Encoder Representations from Transformers (BERT), and standard classifiers such as Support Vector Machine (SVM) and Naïve Bayes (NB)
This paper proposes a method that receives as input pre-processed news to reduce the chances of underfitting or overfitting and performs several analyses and transformations, using the term frequency-inverse document frequency (TF-IDF) for feature extraction

Summary

Introduction

Artificial intelligence (AI) brought important changes in the domain of information technologies and architectures, such as using and developing intelligent transportation systems, virtual personal assistants, robotic surgery, and maybe with the greatest impact on our lives, natural language processing [1]. Fake news can function as propaganda or misinformation, but it always appeals to the emotions of the public and the intent to cover rational responses, analysis, and comparison of information from several sources, encouraging inflammation and outrage and can lead to conspiracy theories and partisan biased content that negatively affects social security. There are other ways to address misinformation such as conspiracies, discrediting, emotion, or social media feeds that may influence democratic processes [6]. In this paper, a statistical analysis for a dataset of online articles with real and fake classes is presented. A sentiment analysis and irony detection systems are applied to our datasets, providing important information in the process of detecting fake news.

Related Work

Dataset Description

Data Pre-Processing

Experiments

Deep Learning Architectures

Transformer Architectures

Results and Discussion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information	Publication Date: Mar 14, 2022
Citations: 19	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Automatic Fake News Detection for Romanian Online News

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information

Lead the way for us

Similar Papers

Engineering Document Summarization Using Sentence Representations Generated by Bidirectional Language Model
Yan Jin ... Yunjian Qiu
-
Yan Jin, et. al.Yan Jin ... Yunjian Qiu
17 Aug 2021
17 Aug 2021

Multilingual emoji prediction using BERT for sentiment analysis
Toshiki Tomihira ... Atsushi Otsuka
International Journal of Web Information Systems | VOL. 16
Toshiki Tomihira, et. al.Toshiki Tomihira ... Atsushi Otsuka
21 Sep 2020
International Journal of Web Information Systems | VOL. 16

Sentimental Analysis using Bert Algorithm over LSTM
Mohammed Athar Rangila ... Komal Kamble
International Journal of Advanced Research in Science, Communication and Technology | VOL. -
Mohammed Athar Rangila, et. al. Mohammed Athar Rangila ... Komal Kamble
24 Oct 2022
International Journal of Advanced Research in Science, Communication and Technology | VOL. -

Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing
Sifei Han ... Fuchiang R Tsui
Journal of Biomedical Informatics | VOL. 127
Sifei Han, et. al.Sifei Han ... Fuchiang R Tsui
07 Jan 2022
Journal of Biomedical Informatics | VOL. 127

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic Fake News Detection for Romanian Online News

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information