Exploring low-resource medical image classification with weakly supervised prompt learning

Fudan Zheng,Jindong Cao,Weijiang Yu,Zhiguang Chen,Nong Xiao,Yutong Lu

doi:10.1016/j.patcog.2024.110250

Abstract

Most advances in medical image recognition supporting clinical auxiliary diagnosis meet challenges due to the low-resource situation in the medical field, where annotations are highly expensive and professional. This low-resource problem can be alleviated by leveraging the transferable representations of large-scale pre-trained vision-language models like CLIP. After being pre-trained using large-scale unlabeled medical images and texts (such as medical reports), the vision-language models can learn transferable representations and support flexible downstream clinical tasks such as medical image classification via relevant medical text prompts. However, existing pre-trained vision-language models require domain experts (clinicians) to carefully design the medical text prompts based on different datasets when applied to specific medical image tasks, which is extremely time-consuming and greatly increases the burden on clinicians. To address this problem, we propose a weakly supervised prompt learning method MedPrompt for automatically generating medical prompts, which includes an unsupervised pre-trained vision-language model and a weakly supervised prompt learning model. The unsupervised pre-trained vision-language model adopts large-scale medical images and texts for pre-training, utilizing the natural correlation between medical images and corresponding medical texts without manual annotations. The weakly supervised prompt learning model only utilizes the classes of images in the dataset to guide the learning of the specific class vector in the prompt, while the learning of other context vectors in the prompt does not require any manual annotations for guidance. To the best of our knowledge, this is the first model to automatically generate medical prompts. With the assistance of these prompts, the pre-trained vision-language model can be freed from the strong expert dependency of manual annotation and manual prompt design, thus achieving end-to-end, low-cost medical image classification. Experimental results show that the model using our automatically generated prompts outperforms all its hand-crafted prompts counterparts in full-shot learning on all four datasets, and achieves superior accuracy on zero-shot image classification and few-shot learning in three of the four medical benchmark datasets and comparable accuracy in the remaining one. In addition, the proposed prompt generator is lightweight and therefore has the potential to be embedded into any network architecture.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring low-resource medical image classification with weakly supervised prompt learning

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Jan 6, 2024
Citations: 4

Similar Papers

Automatic classification of medical X-ray images with convolutional neural networks
Xolisani Nkwentsha ... Anicet Hounkanrin
-
Xolisani Nkwentsha, et. al.Xolisani Nkwentsha ... Anicet Hounkanrin
01 Jan 2020
01 Jan 2020

MResCaps: Enhancing capsule networks with parallel lanes and residual blocks for high‐performance medical image classification
Sümeyra Büşra Şengül ... İlker Ali Özkan
International Journal of Imaging Systems and Technology | VOL. 34
Sümeyra Büşra Şengül, et. al.Sümeyra Büşra Şengül ... İlker Ali Özkan
30 May 2024
International Journal of Imaging Systems and Technology | VOL. 34

LitefusionNet: Boosting the performance for medical image classification with an intelligent and lightweight feature fusion network
Sohaib Asif ... Monir Abdullah
Journal of Computational Science | VOL. 80
Sohaib Asif, et. al.Sohaib Asif ... Monir Abdullah
25 May 2024
Journal of Computational Science | VOL. 80

Real‐Time Medical Image Classification with ML Framework and Dedicated CNN–LSTM Architecture
Imrus Salehin ... Nazrul Amin
Journal of Sensors | VOL. 2023
Imrus Salehin, et. al.Imrus Salehin ... Nazrul Amin
01 Jan 2023
Journal of Sensors | VOL. 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring low-resource medical image classification with weakly supervised prompt learning

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition