Significantly improving zero-shot X-ray pathology classification via fine-tuning pre-trained image-text encoders

Jongseong Jang,Daeun Kyung,Seung Hwan Kim,Honglak Lee,Kyunghoon Bae,Edward Choi

doi:10.1038/s41598-024-73695-z

Abstract

Deep neural networks are increasingly used in medical imaging for tasks such as pathological classification, but they face challenges due to the scarcity of high-quality, expert-labeled training data. Recent efforts have utilized pre-trained contrastive image-text models like CLIP, adapting them for medical use by fine-tuning the model with chest X-ray images and corresponding reports for zero-shot pathology classification, thus eliminating the need for pathology-specific annotations. However, most studies continue to use the same contrastive learning objectives as in the general domain, overlooking the multi-labeled nature of medical image-report pairs. In this paper, we propose a new fine-tuning strategy that includes positive-pair loss relaxation and random sentence sampling. We aim to improve the performance of zero-shot pathology classification without relying on external knowledge. Our method can be applied to any pre-trained contrastive image-text encoder and easily transferred to out-of-domain datasets without further training, as it does not use external data. Our approach consistently improves overall zero-shot pathology classification across four chest X-ray datasets and three pre-trained models, with an average macro AUROC increase of 4.3%. Additionally, our method outperforms the state-of-the-art and marginally surpasses board-certified radiologists in zero-shot classification for the five competition pathologies in the CheXpert dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Significantly improving zero-shot X-ray pathology classification via fine-tuning pre-trained image-text encoders

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Journal: Scientific Reports	Publication Date: Oct 5, 2024
License type: CC BY-NC-ND 4.0

Similar Papers

Multi-label classification of chest X-ray images with pre-trained vision Transformer model
Suxia Xing ... Fuqiang Fan
Journal of Image and Graphics | VOL. 28
Suxia Xing, et. al.Suxia Xing ... Fuqiang Fan
01 Jan 2023
Journal of Image and Graphics | VOL. 28

Custom Weighted Balanced Loss function for Covid 19 Detection from an Imbalanced CXR Dataset
Mrinal Tyagi ... Vibhuti Bansal
-
Mrinal Tyagi, et. al.Mrinal Tyagi ... Vibhuti Bansal
21 Aug 2022
21 Aug 2022

Efficient Deep CNN Model for COVID-19 Classification
Walid El-Shafai ... Adel S El-Fishawy
Computers, Materials & Continua | VOL. 70
Walid El-Shafai, et. al.Walid El-Shafai ... Adel S El-Fishawy
01 Jan 2021
Computers, Materials & Continua | VOL. 70

An AI-enabled pre-trained model-based Covid detection model using chest X-ray images.
Rajeev Kumar Gupta ... Babita Pathik
Multimedia tools and applications | VOL. 81
Rajeev Kumar Gupta, et. al.Rajeev Kumar Gupta ... Babita Pathik
12 Jul 2022
Multimedia tools and applications | VOL. 81

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Significantly improving zero-shot X-ray pathology classification via fine-tuning pre-trained image-text encoders

Abstract

Talk to us

Similar Papers

More From: Scientific Reports