Structural Encoding and Pre-training Matter: Adapting BERT for Table-Based Fact Verification

Rui Dong,David Smith

doi:10.18653/v1/2021.eacl-main.201

Abstract

Growing concern with online misinformation has encouraged NLP research on fact verification. Since writers often base their assertions on structured data, we focus here on verifying textual statements given evidence in tables. Starting from the Table Parsing (TAPAS) model developed for question answering (Herzig et al., 2020), we find that modeling table structure improves a language model pre-trained on unstructured text. Pre-training language models on English Wikipedia table data further improves performance. Pre-training on a question answering task with column-level cell rank information achieves the best performance. With improved pre-training and cell embeddings, this approach outperforms the state-of-the-art Numerically-aware Graph Neural Network table fact verification model (GNN-TabFact), increasing statement classification accuracy from 72.2% to 73.9% even without modeling numerical information. Incorporating numerical information with cell rankings and pre-training on a question-answering task increases accuracy to 76%. We further analyze accuracy on statements implicating single rows or multiple rows and columns of tables, on different numerical reasoning subtasks, and on generalizing to detecting errors in statements derived from the ToTTo table-to-text generation dataset.

Highlights

The rapid growth in the amount and sources of online textual content has raised concerns about misinformation and its potential harmful impacts on society when quickly spread to a massive audience
We propose to adapt the Table Parsing (TAPAS) model (Herzig et al, 2020), which has proven effective in question answering over tables, to model tables for fact verification
The TAPAS-Row-Col-Rank model pre-trained on the question answering task over tables achieves the best performance

Summary

Introduction

The rapid growth in the amount and sources of online textual content has raised concerns about misinformation and its potential harmful impacts on society when quickly spread to a massive audience. Concerns about misinformation have stimulated extensive research on automatic fact verification, i.e., verifying whether a given textual statement is entailed or refuted by the given evidence. Chen et al (2019) introduced a new large-scale dataset, TabFact, for verifying statements based on structured evidence in tables. Traditional language models trained on unstructured text are not directly applicable to learn representations for structured text. Detecting misinformation with structured evidence involves linguistic inference and numerical reasoning such as addition, subtraction, sorting, and counting

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Structural Encoding and Pre-training Matter: Adapting BERT for Table-Based Fact Verification

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2021
Citations: 2	License type: cc-by

Similar Papers

Towards an Enhanced Understanding of Bias in Pre-trained Neural Language Models: A Survey with Special Emphasis on Affective Bias
Anoop K ... Manjary P Gangan
-
Anoop K, et. al. Anoop K ... Manjary P Gangan
01 Jan 2021
01 Jan 2021

Neural Transfer Learning For Vietnamese Sentiment Analysis Using Pre-trained Contextual Language Models
An Pha Le ... Tran Vu Pham
-
An Pha Le, et. al.An Pha Le ... Tran Vu Pham
16 Dec 2021
16 Dec 2021

On the Power of Pre-Trained Text Representations
Yu Meng ... Jiaxin Huang
-
Yu Meng, et. al.Yu Meng ... Jiaxin Huang
14 Aug 2021
14 Aug 2021

Code Question Answering via Task-Adaptive Sequence-to-Sequence Pre-training
Tingrui Yu ... Beijun Shen
-
Tingrui Yu, et. al.Tingrui Yu ... Beijun Shen
01 Dec 2022
01 Dec 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Structural Encoding and Pre-training Matter: Adapting BERT for Table-Based Fact Verification

Abstract

Highlights

Summary

Talk to us

Similar Papers