A question-entailment approach to question answering

Asma Ben Abacha,Dina Demner-Fushman

doi:10.1186/s12859-019-3119-4

Asma Ben Abacha, Dina Demner-Fushman

Open Access

https://doi.org/10.1186/s12859-019-3119-4

Copy DOI

Abstract

BackgroundOne of the challenges in large-scale information retrieval (IR) is developing fine-grained and domain-specific methods to answer natural language questions. Despite the availability of numerous sources and datasets for answer retrieval, Question Answering (QA) remains a challenging problem due to the difficulty of the question understanding and answer extraction tasks. One of the promising tracks investigated in QA is mapping new questions to formerly answered questions that are “similar”.ResultsWe propose a novel QA approach based on Recognizing Question Entailment (RQE) and we describe the QA system and resources that we built and evaluated on real medical questions. First, we compare logistic regression and deep learning methods for RQE using different kinds of datasets including textual inference, question similarity, and entailment in both the open and clinical domains. Second, we combine IR models with the best RQE method to select entailed questions and rank the retrieved answers. To study the end-to-end QA approach, we built the MedQuAD collection of 47,457 question-answer pairs from trusted medical sources which we introduce and share in the scope of this paper. Following the evaluation process used in TREC 2017 LiveQA, we find that our approach exceeds the best results of the medical task with a 29.8% increase over the best official score.ConclusionsThe evaluation results support the relevance of question entailment for QA and highlight the effectiveness of combining IR and RQE for future QA efforts. Our findings also show that relying on a restricted set of reliable answer sources can bring a substantial improvement in medical QA.

Highlights

One of the challenges in large-scale information retrieval (IR) is developing fine-grained and domain-specific methods to answer natural language questions
Datasets Used for the Recognizing Question Entailment (RQE) Study Training Datasets We evaluate the RQE methods using two datasets of sentence pairs (SNLI and multiNLI), and three datasets of question pairs (Quora, Clinical-QE, and SemEval-cQA)
We evaluated the answers returned by the IR-based method and the hybrid Question Answering (QA) method (IR+RQE) according to the same reference answers used in LiveQA-Med

Summary

Introduction

One of the challenges in large-scale information retrieval (IR) is developing fine-grained and domain-specific methods to answer natural language questions. Despite the availability of numerous sources and datasets for answer retrieval, Question Answering (QA) remains a challenging problem due to the difficulty of the question understanding and answer extraction tasks. With the availability of rich data on users’ locations, profiles, and search histories, personalization has become the leading trend in large-scale information retrieval. Efficiency through personalization is not yet the most suitable model when tackling domain-specific searches. This is due to several factors, such as the lexical and semantic challenges of domain-specific data that often include advanced argumentation and complex contextual information, the higher sparseness of relevant information sources, and the more pronounced lack of similarities between users’ searches. With the abundance of information sources in the medical domain, consumers are increasingly faced with a similar challenge, one that needs dedicated solutions that can adapt to the heterogeneity and specifics of health-related information

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Oct 22, 2019
Citations: 142	License type: open-access

R Discovery Prime

R Discovery Prime

A question-entailment approach to question answering

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Biomedical question answering: A survey
Sofia J Athenikos ... Hyoil Han
Computer Methods and Programs in Biomedicine | VOL. 99
Sofia J Athenikos, et. al.Sofia J Athenikos ... Hyoil Han
13 Nov 2009
Computer Methods and Programs in Biomedicine | VOL. 99

Epidemic Question Answering: question generation and entailment for Answer Nugget discovery.
Maxwell A Weinzierl ... Sanda M Harabagiu
Journal of the American Medical Informatics Association : JAMIA | VOL. 30
Maxwell A Weinzierl, et. al.Maxwell A Weinzierl ... Sanda M Harabagiu
17 Nov 2022
Journal of the American Medical Informatics Association : JAMIA | VOL. 30

ANU-CSIRO at MEDIQA 2019: Question Answering Using Deep Contextual Knowledge
Vincent Nguyen ... Zhenchang Xing
-
Vincent Nguyen, et. al.Vincent Nguyen ... Zhenchang Xing
01 Jan 2019
ANU-CSIRO at MEDIQA 2019: Question Answering Using Deep Contextual Knowledge
Vincent Nguyen ... Zhenchang Xing

Learning to select the correct answer in multi-stream question answering
Alberto Téllez-Valero ... Anselmo Peñas Padilla
Information Processing and Management | VOL. 47
Alberto Téllez-Valero, et. al.Alberto Téllez-Valero ... Anselmo Peñas Padilla
13 May 2010
Information Processing and Management | VOL. 47

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A question-entailment approach to question answering

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics