Can NLI Models Verify QA Systems’ Predictions?

Jifan Chen,Greg Durrett,Eunsol Choi

doi:10.18653/v1/2021.findings-emnlp.324

Abstract

To build robust question answering systems, we need the ability to verify whether answers to questions are truly correct, not just "good enough" in the context of imperfect QA datasets. We explore the use of natural language inference (NLI) as a way to achieve this goal, as NLI inherently requires the premise (document context) to contain all necessary information to support the hypothesis (proposed answer to the question). We leverage large pre-trained models and recent prior datasets to construct powerful question converter and decontextualization modules, which can reformulate QA instances as premise-hypothesis pairs with very high reliability. Then, by combining standard NLI datasets with NLI examples automatically derived from QA training data, we can train NLI models to judge the correctness of QA models' proposed answers. We show that our NLI approach can generally improve the confidence estimation of a QA model across different domains, evaluated in a selective QA setting. Careful manual analysis over the predictions of our NLI model shows that it can further identify cases where the QA model produces the right answer for the wrong reason, or where the answer cannot be verified as addressing all aspects of the question.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Can NLI Models Verify QA Systems’ Predictions?

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2021
Citations: 9	License type: cc-by

Similar Papers

Can NLI Models Verify QA Systems' Predictions?
...
-
, et. al. ...
23 Oct 2021
23 Oct 2021

Bilinear Fusion of Commonsense Knowledge with Attention-Based NLI Models
Amit Gajbhiye ... Noura Al Moubayed
-
Amit Gajbhiye, et. al.Amit Gajbhiye ... Noura Al Moubayed
01 Jan 2020
01 Jan 2020

Knowledge-Enhanced Evidence Retrieval for Counterargument Generation
...
-
, et. al. ...
23 Oct 2021
23 Oct 2021

Knowledge-Enhanced Evidence Retrieval for Counterargument Generation
Yohan Jo ... Jinyeong Bak
-
Yohan Jo, et. al.Yohan Jo ... Jinyeong Bak
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Can NLI Models Verify QA Systems’ Predictions?

Abstract

Talk to us

Similar Papers