A dataset for medical instructional video classification and question answering

Deepak Gupta,Dina Demner-Fushman,Kush Attal

doi:10.1038/s41597-023-02036-y

Deepak Gupta, Dina Demner-Fushman + Show 1 more

Open Access

https://doi.org/10.1038/s41597-023-02036-y

Copy DOI

Abstract

This paper introduces a new challenge and datasets to foster research toward designing systems that can understand medical videos and provide visual answers to natural language questions. We believe medical videos may provide the best possible answers to many first aid, medical emergency, and medical education questions. Toward this, we created the MedVidCL and MedVidQA datasets and introduce the tasks of Medical Video Classification (MVC) and Medical Visual Answer Localization (MVAL), two tasks that focus on cross-modal (medical language and medical video) understanding. The proposed tasks and datasets have the potential to support the development of sophisticated downstream applications that can benefit the public and medical practitioners. Our datasets consist of 6,117 fine-grained annotated videos for the MVC task and 3,010 questions and answers timestamps from 899 videos for the MVAL task. These datasets have been verified and corrected by medical informatics experts. We have also benchmarked each task with the created MedVidCL and MedVidQA datasets and propose the multimodal learning methods that set competitive baselines for future research.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Data	Publication Date: Mar 22, 2023
Citations: 13	License type: open-access

R Discovery Prime

R Discovery Prime

A dataset for medical instructional video classification and question answering

Abstract

Talk to us

Similar Papers

More From: Scientific Data

Lead the way for us

Similar Papers

Overview of the MedVidQA 2022 Shared Task on Medical Video Question-Answering
...
-
, et. al. ...
12 May 2022
Overview of the MedVidQA 2022 Shared Task on Medical Video Question-Answering
...

ANU-CSIRO at MEDIQA 2019: Question Answering Using Deep Contextual Knowledge
Vincent Nguyen ... Zhenchang Xing
-
Vincent Nguyen, et. al.Vincent Nguyen ... Zhenchang Xing
01 Jan 2019
ANU-CSIRO at MEDIQA 2019: Question Answering Using Deep Contextual Knowledge
Vincent Nguyen ... Zhenchang Xing

Overview of the NLPCC 2023 Shared Task: Chinese Medical Instructional Video Question Answering
Bin Li ... Yuwei Han
-
Bin Li, et. al.Bin Li ... Yuwei Han
01 Jan 2023
Overview of the NLPCC 2023 Shared Task: Chinese Medical Instructional Video Question Answering
Bin Li ... Yuwei Han

Overview of the NLPCC 2024 Shared Task 7: Multi-lingual Medical Instructional Video Question Answering
Bin Li ... Shoujun Zhou
-
Bin Li, et. al.Bin Li ... Shoujun Zhou
01 Nov 2024
Overview of the NLPCC 2024 Shared Task 7: Multi-lingual Medical Instructional Video Question Answering
Bin Li ... Shoujun Zhou

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A dataset for medical instructional video classification and question answering

Abstract

Talk to us

Similar Papers

More From: Scientific Data