Hierarchical medical image report adversarial generation with hybrid discriminator

Junsan Zhang,Ming Cheng,Qiaoqiao Cheng,Xiuxuan Shen,Yao Wan,Jie Zhu,Mengxuan Liu

doi:10.1016/j.artmed.2024.102846

Abstract

Background and objectivesGenerating coherent reports from medical images is an important task for reducing doctors' workload. Unlike traditional image captioning tasks, the task of medical image report generation faces more challenges. Current models for generating reports from medical images often fail to characterize some abnormal findings, and some models generate reports with low quality. In this study, we propose a model to generate high-quality reports from medical images. MethodsIn this paper, we propose a model called Hybrid Discriminator Generative Adversarial Network (HDGAN), which combines Generative Adversarial Network (GAN) with Reinforcement Learning (RL). The HDGAN model consists of a generator, a one-sentence discriminator, and a one-word discriminator. Specifically, the RL reward signals are judged on the one-sentence discriminator and one-word discriminator separately. The one-sentence discriminator can better learn sentence-level structural information, while the one-word discriminator can learn word diversity information effectively. ResultsOur approach performs better on the IU-X-ray and COV-CTR datasets than the baseline models. For the ROUGE metric, our method outperforms the state-of-the-art model by 0.36 on the IU-X-ray, 0.06 on the MIMIC-CXR and 0.156 on the COV-CTR. ConclusionsThe compositional framework we proposed can generate more accurate medical image reports at different levels.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hierarchical medical image report adversarial generation with hybrid discriminator

Abstract

Talk to us

Similar Papers

More From: Artificial intelligence in medicine

Lead the way for us

Similar Papers

Research on automatic generation of multimodal medical image reports based on memory driven
Junze Fang ... Zihan Ju
Sheng wu yi xue gong cheng xue za zhi = Journal of biomedical engineering = Shengwu yixue gongchengxue zazhi | VOL. 41
Junze Fang, et. al.Junze Fang ... Zihan Ju
25 Feb 2024
Sheng wu yi xue gong cheng xue za zhi = Journal of biomedical engineering = Shengwu yixue gongchengxue zazhi | VOL. 41

Reinforced Transformer for Medical Image Captioning
Yuxuan Xiong ... Pingkun Yan
-
Yuxuan Xiong, et. al.Yuxuan Xiong ... Pingkun Yan
01 Jan 2019
01 Jan 2019

A label information fused medical image report generation framework
Shuifa Sun ... Yirong Wu
Artificial Intelligence In Medicine | VOL. 150
Shuifa Sun, et. al.Shuifa Sun ... Yirong Wu
22 Feb 2024
Artificial Intelligence In Medicine | VOL. 150

ICIPEMIR: Improving the Completeness, Interoperability and Patient Explanations of Medical Imaging Reports.
Arthur Lauriot Dit Prevost ... Guillaume Bouzille
Studies in health technology and informatics | VOL. 281
Arthur Lauriot Dit Prevost, et. al.Arthur Lauriot Dit Prevost ... Guillaume Bouzille
27 May 2021
Studies in health technology and informatics | VOL. 281

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hierarchical medical image report adversarial generation with hybrid discriminator

Abstract

Talk to us

Similar Papers

More From: Artificial intelligence in medicine