What Disease Does This Patient Have? A Large-Scale Open Domain Question Answering Dataset from Medical Exams

Di Jin,Nassim Oufattole,Hanyi Fang,Eileen Pan,Wei-Hung Weng,Peter Szolovits

doi:10.3390/app11146421

Di Jin, Nassim Oufattole + Show 4 more

Open Access

https://doi.org/10.3390/app11146421

Copy DOI

Abstract

Open domain question answering (OpenQA) tasks have been recently attracting more and more attention from the natural language processing (NLP) community. In this work, we present the first free-form multiple-choice OpenQA dataset for solving medical problems, MedQA, collected from the professional medical board exams. It covers three languages: English, simplified Chinese, and traditional Chinese, and contains 12,723, 34,251, and 14,123 questions for the three languages, respectively. We implement both rule-based and popular neural methods by sequentially combining a document retriever and a machine comprehension model. Through experiments, we find that even the current best method can only achieve 36.7%, 42.0%, and 70.1% of test accuracy on the English, traditional Chinese, and simplified Chinese questions, respectively. We expect MedQA to present great challenges to existing OpenQA systems and hope that it can serve as a platform to promote much stronger OpenQA models from the NLP community in the future.

Highlights

Question answering (QA) is a fundamental task in Natural Language Processing (NLP), which requires models to answer a particular question
Models are required to find and extract relevant information to questions from large-scale text sources such as a search engine [9] and Wikipedia [10]. This type of task is generally called as open-domain question answering (OpenQA), which has recently attracted lots of attention from the natural language processing (NLP) community [11,12,13] but still remains far from being solved
Most previous works for OpenQA focus on datasets in which answers are in the format of spans and can be found based on the information explicitly expressed in the provided text [9,10,14,15]

Summary

Introduction

Question answering (QA) is a fundamental task in Natural Language Processing (NLP), which requires models to answer a particular question. Real-world scenarios for QA are usually much more complex and one may not have a body of text already labeled as containing the answer to the question In this scenario, models are required to find and extract relevant information to questions from large-scale text sources such as a search engine [9] and Wikipedia [10]. As a more challenging task, freeform multiple-choice OpenQA datasets such as ARC [16] and OpenBookQA [17] contain a significant percentage of questions focusing on the implicitly expressed facts, events, opinions, or emotions in the retrieved text To answer these questions, models need to perform logical reasoning over the information presented in the retrieved text and in some cases even need to integrate some prior knowledge. These OpenQA datasets consist of questions that require only elementary or middle school level knowledge (e.g., “Which object would let the most heat travel through?”), so even excellent models trained on them may be unable to support more sophisticated real-world scenarios

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Jul 12, 2021
Citations: 76	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

What Disease Does This Patient Have? A Large-Scale Open Domain Question Answering Dataset from Medical Exams

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks.
Zhongqiang Jiang ... Jiangbin Zheng
Computational intelligence and neuroscience | VOL. 2020
Zhongqiang Jiang, et. al.Zhongqiang Jiang ... Jiangbin Zheng
03 Dec 2020
Computational intelligence and neuroscience | VOL. 2020

ERP evidence for asymmetric orthographic transfer between traditional and simplified Chinese.
Jiushu Xie ... Qian Lin
Experimental brain research | VOL. 239
Jiushu Xie, et. al.Jiushu Xie ... Qian Lin
13 Nov 2020
Experimental brain research | VOL. 239

Comparing word recognition in simplified and traditional Chinese: A megastudy approach.
Yiu-Kei Tsang ... Jian Huang
Quarterly Journal of Experimental Psychology | VOL. 77
Yiu-Kei Tsang, et. al.Yiu-Kei Tsang ... Jian Huang
05 Jun 2023
Quarterly Journal of Experimental Psychology | VOL. 77

Building a deep learning-based QA system from a CQA dataset
Sol Jin ... Jihae Suh
Decision Support Systems | VOL. 175
Sol Jin, et. al.Sol Jin ... Jihae Suh
15 Jun 2023
Decision Support Systems | VOL. 175

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

What Disease Does This Patient Have? A Large-Scale Open Domain Question Answering Dataset from Medical Exams

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences