Bootstrapping Large Language Models for Radiology Report Generation

Chang Liu,Yuanhe Tian,Yan Song,Weidong Chen,Yongdong Zhang

doi:10.1609/aaai.v38i17.29826

Abstract

Radiology report generation (RRG) aims to automatically generate a free-text description from a specific clinical radiograph, e.g., chest X-Ray images. Existing approaches tend to perform RRG with specific models trained on the public yet limited data from scratch, where they often lead to inferior performance owing to the problem of inefficient capabilities in both aligning visual and textual features and generating informative reports accordingly. Currently, large language models (LLMs) offered a promising solution to text generation with their power in learning from big data, especially for cross-modal scenarios such as RRG. However, most existing LLMs are pre-trained on general data, and suffer from the same problem of conventional approaches caused by knowledge gap between general and medical domain if they are applied to RRG. Therefore in this paper, we propose an approach to bootstrapping LLMs for RRG with a in-domain instance induction and a coarse-to-fine decoding process. Specifically, the in-domain instance induction process learns to align the LLM to radiology reports from general texts through contrastive learning. The coarse-to-fine decoding performs a text elevating process for those reports from the ranker, further enhanced with visual features and refinement prompts. Experimental results on two prevailing RRG datasets, namely, IU X-Ray and MIMIC-CXR, demonstrate the superiority of our approach to previous state-of-the-art solutions. Further analyses illustrate that, for the LLM, the induction process enables it to better align with the medical domain and the coarse-to-fine generation allows it to conduct more precise text generation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bootstrapping Large Language Models for Radiology Report Generation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Mar 24, 2024
Citations: 2

Similar Papers

S3-Net: A Self-Supervised Dual-Stream Network for Radiology Report Generation.
Renjie Pan ... Shaoguo Cui
IEEE Journal of Biomedical and Health Informatics | VOL. 28
Renjie Pan, et. al.Renjie Pan ... Shaoguo Cui
01 Mar 2024
IEEE Journal of Biomedical and Health Informatics | VOL. 28

How Can IJDS Authors, Reviewers, and Editors Use (and Misuse) Generative AI?
Galit Shmueli ... Bianca Maria Colosimo
INFORMS Journal on Data Science | VOL. 2
Galit Shmueli, et. al.Galit Shmueli ... Bianca Maria Colosimo
01 Apr 2023
INFORMS Journal on Data Science | VOL. 2

Radiology report generation using transformers conditioned with non-imaging data
Nurbanu Aksoy ... Nishant Ravikumar
-
Nurbanu Aksoy, et. al.Nurbanu Aksoy ... Nishant Ravikumar
10 Apr 2023
10 Apr 2023

Chest radiology report generation based on cross-modal multi-scale feature fusion
Yu Pan ... Qing-Song Huang
Journal of Radiation Research and Applied Sciences | VOL. 17
Yu Pan, et. al.Yu Pan ... Qing-Song Huang
13 Jan 2024
Journal of Radiation Research and Applied Sciences | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bootstrapping Large Language Models for Radiology Report Generation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence