Domain Bias in Fake News Datasets Consisting of Fake and Real News Pairs

Shingo Kato,Daisuke Ikeda,Linshuo Yang

doi:10.1109/iiaiaai55812.2022.00029

Abstract

News intentionally containing false information–known as "fake news"–is common on the Internet and often causes social disruption. In order to solve it, research on automatic detection of fake news using supervised learning has been active. Although the accuracy is improving, a major challenge for practical application remains: models can not work well for news in unknown fields (domains) due to domain biases. The goal of this study is to mitigate these domain biases and improve the accuracy of cross-domain fake news detection, which tests news from unknown domains. We firstly try to mitigate the bias by masking noun phrases which are considered a major source of domain bias. However, masking has not improved accuracy. Therefore, we point out that the dataset in this study has the property that it always contains pairs of fake and real news on the exact same topic. In this paper, we focus on this property of dataset and examine how it may affect domain bias and accuracy. Comparative experiments show that accuracy is higher when trained on a dataset with the property shown in this study. We suggest that a fake news dataset consisting of paired news could be effective for cross-domain detection.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Domain Bias in Fake News Datasets Consisting of Fake and Real News Pairs

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A First Step Towards Combating Fake News over Online Social Media
Kuai Xu ... Bo Yang
-
Kuai Xu, et. al.Kuai Xu ... Bo Yang
01 Jan 2018
01 Jan 2018

The role of analytical reasoning and source credibility on the evaluation of real and fake full-length news articles
Didem Pehlivanoglu ... Farha Deceus
Cognitive Research: Principles and Implications | VOL. 6
Didem Pehlivanoglu, et. al.Didem Pehlivanoglu ... Farha Deceus
31 Mar 2021
Cognitive Research: Principles and Implications | VOL. 6

Human Brains Can’t Detect Fake News: A Neuro-Cognitive Study of Textual Disinformation Susceptibility
Cagri Arisoy ... Nitesh Saxena
-
Cagri Arisoy, et. al.Cagri Arisoy ... Nitesh Saxena
22 Aug 2022
22 Aug 2022

The PolitiFact-Oslo Corpus: A New Dataset for Fake News Analysis and Detection
Nele Põldvere ... Aleena Thomas
Information | VOL. 14
Nele Põldvere, et. al.Nele Põldvere ... Aleena Thomas
23 Nov 2023
Information | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Domain Bias in Fake News Datasets Consisting of Fake and Real News Pairs

Abstract

Talk to us

Similar Papers