Improving chest X-ray report generation by leveraging warm starting

Aaron Nicolson,Jason Dowling,Bevan Koopman

doi:10.1016/j.artmed.2023.102633

Abstract

Automatically generating a report from a patient’s Chest X-rays (CXRs) is a promising solution to reducing clinical workload and improving patient care. However, current CXR report generators—which are predominantly encoder-to-decoder models—lack the diagnostic accuracy to be deployed in a clinical setting. To improve CXR report generation, we investigate warm starting the encoder and decoder with recent open-source computer vision and natural language processing checkpoints, such as the Vision Transformer (ViT) and PubMedBERT. To this end, each checkpoint is evaluated on the MIMIC-CXR and IU X-ray datasets. Our experimental investigation demonstrates that the Convolutional vision Transformer (CvT) ImageNet-21K and the Distilled Generative Pre-trained Transformer 2 (DistilGPT2) checkpoints are best for warm starting the encoder and decoder, respectively. Compared to the state-of-the-art (M2 Transformer Progressive), CvT2DistilGPT2 attained an improvement of 8.3% for CE F-1, 1.8% for BLEU-4, 1.6% for ROUGE-L, and 1.0% for METEOR. The reports generated by CvT2DistilGPT2 have a higher similarity to radiologist reports than previous approaches. This indicates that leveraging warm starting improves CXR report generation. Code and checkpoints for CvT2DistilGPT2 are available at https://github.com/aehrc/cvt2distilgpt2.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Artificial Intelligence in Medicine	Publication Date: Aug 19, 2023
Citations: 21	License type: cc-by

R Discovery Prime

R Discovery Prime

Improving chest X-ray report generation by leveraging warm starting

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence in Medicine

Lead the way for us

Similar Papers

Computer Vision and Natural Language Processing
Peratham Wiriyathammabhum ... Yiannis Aloimonos
ACM Computing Surveys | VOL. 49
Peratham Wiriyathammabhum, et. al.Peratham Wiriyathammabhum ... Yiannis Aloimonos
12 Dec 2016
ACM Computing Surveys | VOL. 49

Using Project-Based Learning to Teach Advanced Practice Nurses About Quality Improvement
Jaime Mcdermott
AACN Advanced Critical Care | VOL. 33
Jaime McdermottJaime Mcdermott
15 Dec 2022
AACN Advanced Critical Care | VOL. 33

VQAR: Review on Information Retrieval Techniques based on Computer Vision and Natural Language Processing
Shivangi Modi ... Dhatri Pandya
-
Shivangi Modi, et. al.Shivangi Modi ... Dhatri Pandya
01 Mar 2019
01 Mar 2019

Error Analysis for Visual Question Answering
Artur Podtikhov ... Alexey K Kovalev
-
Artur Podtikhov, et. al.Artur Podtikhov ... Alexey K Kovalev
02 Oct 2020
02 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving chest X-ray report generation by leveraging warm starting

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence in Medicine