Adapting Open Domain Fact Extraction and Verification to COVID-FACT through In-Domain Language Modeling

Zhenghao Liu,Si Sun,Zhuyun Dai,Maosong Sun,Zhiyuan Liu,Chenyan Xiong

doi:10.18653/v1/2020.findings-emnlp.216

Abstract

With the epidemic of COVID-19, verifying the scientifically false online information, such as fake news and maliciously fabricated statements, has become crucial. However, the lack of training data in the scientific domain limits the performance of fact verification models. This paper proposes an in-domain language modeling method for fact extraction and verification systems. We come up with SciKGAT to combine the advantages of open-domain literature search, state-of-the-art fact verification systems and in-domain medical knowledge through language modeling. Our experiments on SCIFACT, a dataset of expert-written scientific fact verification, show that SciKGAT achieves 30% absolute improvement on precision. Our analyses show that such improvement thrives from our in-domain language model by picking up more related evidence pieces and accurate fact verification. Our codes and data are released via Github.

Highlights

This paper proposes an in-domain language modeling method for fact extraction and verification systems
Some work (Beltagy et al, 2019; Lee et al, 2020) transfers medical domain knowledge into pre-trained language models for better medical semantic understanding, which provides a potential way to deal with COVID-FACT checking problem
We evaluate the impacts of the in-domain language model on individual fact extraction and verification components of Scientific KGAT (SciKGAT)

Summary

Introduction

This paper proposes an in-domain language modeling method for fact extraction and verification systems. Our indomain language modelings improve the fact verification performance with more than 10% absolute F1 score and 30% absolute precision (from 46.6% to 76%) than previous state-of-the-art on SCIFACT. Such improvement shows that our model provides a set of solutions for low-resource fact verification tasks, such as COVID-19. The small-scale training data of SCIFACT may Existing fact extraction and verification models usually employ a three-step pipeline system (Chen et al, 2017): document retrieval (abstract retrieval), sentence selection (rationale selection) and fact verification (Thorne et al, 2018; Wadden et al, 2020)

Objectives

Methods

Results

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adapting Open Domain Fact Extraction and Verification to COVID-FACT through In-Domain Language Modeling

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2020
Citations: 13	License type: cc-by

Similar Papers

Evaluating adversarial attacks against multiple fact verification systems
James Thorne ... Christos Christodoulopoulos
-
James Thorne, et. al.James Thorne ... Christos Christodoulopoulos
01 Jan 2019
01 Jan 2019

EnFVe: An Ensemble Fact Verification Pipeline
John Joy Kurian ... Avinash Ronanki
-
John Joy Kurian, et. al.John Joy Kurian ... Avinash Ronanki
01 Dec 2020
01 Dec 2020

Adversarial Performance Evaluation and Analysis on Recent Strong Models for Fact Verification
Hao Wang ... Yong Dou
-
Hao Wang, et. al.Hao Wang ... Yong Dou
01 Oct 2022
01 Oct 2022

Fine-grained Fact Verification with Kernel Graph Attention Network
Zhenghao Liu ... Zhiyuan Liu
-
Zhenghao Liu, et. al.Zhenghao Liu ... Zhiyuan Liu
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adapting Open Domain Fact Extraction and Verification to COVID-FACT through In-Domain Language Modeling

Abstract

Highlights

Summary

Talk to us

Similar Papers