AmbiFC: Fact-Checking Ambiguous Claims with Evidence

Max Glockner,James Thorne,Andreas Vlachos,Gisela Vallejo,Ieva Staliūnaitė,Iryna Gurevych

doi:10.1162/tacl_a_00629

Max Glockner, James Thorne + Show 4 more

Open Access

https://doi.org/10.1162/tacl_a_00629

Copy DOI

Abstract

Abstract Automated fact-checking systems verify claims against evidence to predict their veracity. In real-world scenarios, the retrieved evidence may not unambiguously support or refute the claim and yield conflicting but valid interpretations. Existing fact-checking datasets assume that the models developed with them predict a single veracity label for each claim, thus discouraging the handling of such ambiguity. To address this issue we present AmbiFC,1 a fact-checking dataset with 10k claims derived from real-world information needs. It contains fine-grained evidence annotations of 50k passages from 5k Wikipedia pages. We analyze the disagreements arising from ambiguity when comparing claims against evidence in AmbiFC, observing a strong correlation of annotator disagreement with linguistic phenomena such as underspecification and probabilistic reasoning. We develop models for predicting veracity handling this ambiguity via soft labels, and find that a pipeline that learns the label distribution for sentence-level evidence selection and veracity prediction yields the best performance. We compare models trained on different subsets of AmbiFC and show that models trained on the ambiguous instances perform better when faced with the identified linguistic phenomena.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AmbiFC: Fact-Checking Ambiguous Claims with Evidence

Abstract

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Jan 9, 2024
License type: CC BY 4.0

Similar Papers

Training Sound Event Detection with Soft Labels from Crowdsourced Annotations
Irene Martín-Morató ... Annamaria Mesaros
-
Irene Martín-Morató, et. al.Irene Martín-Morató ... Annamaria Mesaros
04 Jun 2023
04 Jun 2023

Every Rating Matters: Joint Learning of Subjective Labels and Individual Annotators for Speech Emotion Classification
Huang-Cheng Chou ... Chi-Chun Lee
-
Huang-Cheng Chou, et. al.Huang-Cheng Chou ... Chi-Chun Lee
01 May 2019
01 May 2019

F-Similarity Preservation Loss for Soft Labels: A Demonstration on Cross-Corpus Speech Emotion Recognition
Biqiao Zhang ... Emily Mower Provost
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33
Biqiao Zhang, et. al.Biqiao Zhang ... Emily Mower Provost
17 Jul 2019
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33

Natural Language Semantics using Probabilistic Logic

-

01 Oct 2014
01 Oct 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AmbiFC: Fact-Checking Ambiguous Claims with Evidence

Abstract

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics