Automatic Radiology Reports Generation via Memory Alignment Network

Hongyu Shen,Zhaoxing Tian,Juncai Liu,Mingtao Pei

doi:10.1609/aaai.v38i5.28279

Abstract

The automatic generation of radiology reports is of great significance, which can reduce the workload of doctors and improve the accuracy and reliability of medical diagnosis and treatment, and has attracted wide attention in recent years. Cross-modal mapping between images and text, a key component of generating high-quality reports, is challenging due to the lack of corresponding annotations. Despite its importance, previous studies have often overlooked it or lacked adequate designs for this crucial component. In this paper, we propose a method with memory alignment embedding to assist the model in aligning visual and textual features to generate a coherent and informative report. Specifically, we first get the memory alignment embedding by querying the memory matrix, where the query is derived from a combination of the visual features and their corresponding positional embeddings. Then the alignment between the visual and textual features can be guided by the memory alignment embedding during the generation process. The comparison experiments with other alignment methods show that the proposed alignment method is less costly and more effective. The proposed approach achieves better performance than state-of-the-art approaches on two public datasets IU X-Ray and MIMIC-CXR, which further demonstrates the effectiveness of the proposed alignment method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatic Radiology Reports Generation via Memory Alignment Network

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

The Symbolist Conception of Illustration and Tyra Kleen’s Nevermore
Birte Bruchmüller
The Edgar Allan Poe Review | VOL. 22
Birte BruchmüllerBirte Bruchmüller
01 Jun 2021
The Edgar Allan Poe Review | VOL. 22

Ultrasound Report Generation with Cross-Modality Feature Alignment via Unsupervised Guidance.
Jun Li ... Zhongliang Jiang
IEEE transactions on medical imaging | VOL. PP
Jun Li, et. al.Jun Li ... Zhongliang Jiang
01 Jan 2024
IEEE transactions on medical imaging | VOL. PP

VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search.
Shuting He ... Xudong Jiang
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society | VOL. PP
Shuting He, et. al.Shuting He ... Xudong Jiang
01 Jan 2024
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society | VOL. PP

Combining Visual and Textual Features for Information Extraction from Online Flyers
Emilia Apostolova ... Noriko Tomuro
-
Emilia Apostolova, et. al.Emilia Apostolova ... Noriko Tomuro
01 Jan 2014
01 Jan 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic Radiology Reports Generation via Memory Alignment Network

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence