Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited Questions

Zhihan Zhang,Meng Jiang,Mingxuan Ju,Zheng Ning,Wenhao Yu

doi:10.1162/tacl_a_00591

Abstract

Abstract Contrast consistency, the ability of a model to make consistently correct predictions in the presence of perturbations, is an essential aspect in NLP. While studied in tasks such as sentiment analysis and reading comprehension, it remains unexplored in open-domain question answering (OpenQA) due to the difficulty of collecting perturbed questions that satisfy factuality requirements. In this work, we collect minimally edited questions as challenging contrast sets to evaluate OpenQA models. Our collection approach combines both human annotation and large language model generation. We find that the widely used dense passage retriever (DPR) performs poorly on our contrast sets, despite fitting the training set well and performing competitively on standard test sets. To address this issue, we introduce a simple and effective query-side contrastive loss with the aid of data augmentation to improve DPR training. Our experiments on the contrast sets demonstrate that DPR’s contrast consistency is improved without sacrificing its accuracy on the standard test sets.1

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited Questions

Abstract

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Aug 15, 2023
License type: CC BY 4.0

Similar Papers

DLRCNeg: Deep Learning based Reading Comprehension by handling Negation
Felicia Lilian J ... Mercy Shalinie S
Proceedings of ADCOM | VOL. -
Felicia Lilian J, et. al.Felicia Lilian J ... Mercy Shalinie S
05 Sep 2019
Proceedings of ADCOM | VOL. -

Designing an Interpretable Question Answering System for Vertical Domains Based on Large Language Model and Knowledge Graph
Xiaobin Huang ... Chin Soon Ku
-
Xiaobin Huang, et. al.Xiaobin Huang ... Chin Soon Ku
01 Jul 2024
01 Jul 2024

Performance of deep learning to detect mastoiditis using multiple conventional radiographs of mastoid
Inseon Ryoo ... Leonard Sunwoo
-
Inseon Ryoo, et. al.Inseon Ryoo ... Leonard Sunwoo
11 Nov 2020
11 Nov 2020

Performance of deep learning to detect mastoiditis using multiple conventional radiographs of mastoid.
Kyong Joon Lee ... Dongjun Choi
PloS one | VOL. 15
Kyong Joon Lee, et. al.Kyong Joon Lee ... Dongjun Choi
11 Nov 2020
PloS one | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited Questions

Abstract

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics