A dataset of clinically generated visual questions and answers about radiology images

Jason J Lau,Dina Demner-Fushman,Asma Ben Abacha,Soumya Gayen

doi:10.1038/sdata.2018.251

Jason J Lau, Dina Demner-Fushman + Show 2 more

Open Access

https://doi.org/10.1038/sdata.2018.251

Copy DOI

Journal: Scientific Data	Publication Date: Nov 20, 2018
Citations: 115	License type: open-access

Affiliation: National Center for Biotechnology Information

Abstract

Radiology images are an essential part of clinical decision making and population screening, e.g., for cancer. Automated systems could help clinicians cope with large amounts of images by answering questions about the image contents. An emerging area of artificial intelligence, Visual Question Answering (VQA) in the medical domain explores approaches to this form of clinical decision support. Success of such machine learning tools hinges on availability and design of collections composed of medical images augmented with question-answer pairs directed at the content of the image. We introduce VQA-RAD, the first manually constructed dataset where clinicians asked naturally occurring questions about radiology images and provided reference answers. Manual categorization of images and questions provides insight into clinically relevant tasks and the natural language to phrase them. Evaluating with well-known algorithms, we demonstrate the rich quality of this dataset over other automatically constructed ones. We propose VQA-RAD to encourage the community to design VQA tools with the goals of improving patient care.

Highlights

Background & SummaryVisual question answering (VQA) is a computer vision and artificial intelligence (AI) problem that aims to answer questions about images
Many different techniques are applied to build Visual Question Answering (VQA) systems including computer vision, natural language processing, and deep learning. These systems need to be trained for the task and evaluated on large data collections consisting of images and pairs of questions asked about the images with corresponding answers
There has been great progress in image recognition in radiology[1], the datasets that allowed this are not quite generalizable to VQA because none of the datasets have question-answer pairs directed at the images[2,3]

Summary

Background & Summary

Visual question answering (VQA) is a computer vision and artificial intelligence (AI) problem that aims to answer questions about images. Many different techniques are applied to build VQA systems including computer vision, natural language processing, and deep learning These systems need to be trained for the task and evaluated on large data collections consisting of images and pairs of questions asked about the images with corresponding answers. To overcome the lack of readily available natural visual questions, questions and answers were automatically generated from corresponding captions This resulted in many artificial questions that do not always make sense, to the point where a human could not reason what the questions were trying to ask. Another issue with the dataset is that images were automatically captured from PubMed Central articles. We demonstrate the value of VQA-RAD and use cases by applying several well-known algorithms

Image Selection

Organ System

Answer Type

Data Records

Technical Validation Analysis of Questions and Answers

Author Contributions

Findings

Additional Information

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A dataset of clinically generated visual questions and answers about radiology images

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Data

Lead the way for us

Similar Papers

Improving Visual Question Answering by Referring to Generated Paragraph Captions
Hyounghun Kim ... Mohit Bansal
-
Hyounghun Kim, et. al.Hyounghun Kim ... Mohit Bansal
01 Jan 2019
01 Jan 2019

Answer Again: Improving VQA with Cascaded-Answering Model
Liang Peng ... Yang Yang
IEEE Transactions on Knowledge and Data Engineering | VOL. -
Liang Peng, et. al.Liang Peng ... Yang Yang
01 Jan 2020
IEEE Transactions on Knowledge and Data Engineering | VOL. -

MobiVQA
Qingqing Cao ... Nicholas D. Lane
Proceedings of the ACM on interactive, mobile, wearable and ubiquitous technologies | VOL. 6
Qingqing Cao, et. al.Qingqing Cao ... Nicholas D. Lane
04 Jul 2022
Proceedings of the ACM on interactive, mobile, wearable and ubiquitous technologies | VOL. 6

Multi-Modality Latent Interaction Network for Visual Question Answering
Gao Peng ... Zhanpeng Zhang
-
Gao Peng, et. al.Gao Peng ... Zhanpeng Zhang
01 Oct 2019
01 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A dataset of clinically generated visual questions and answers about radiology images

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Data