Uni4Eye++: A General Masked Image Modeling Multi-modal Pre-training Framework for Ophthalmic Image Classification and Segmentation.

Zhiyuan Cai,Li Lin,Xiaoying Tang,Pujin Cheng,Huaqing He

doi:10.1109/tmi.2024.3422102

Abstract

A large-scale labeled dataset is a key factor for the success of supervised deep learning in most ophthalmic image analysis scenarios. However, limited annotated data is very common in ophthalmic image analysis, since manual annotation is time-consuming and labor-intensive. Self-supervised learning (SSL) methods bring huge opportunities for better utilizing unlabeled data, as they do not require massive annotations. To utilize as many unlabeled ophthalmic images as possible, it is necessary to break the dimension barrier, simultaneously making use of both 2D and 3D images as well as alleviating the issue of catastrophic forgetting. In this paper, we propose a universal self-supervised Transformer framework named Uni4Eye++ to discover the intrinsic image characteristic and capture domain-specific feature embedding in ophthalmic images. Uni4Eye++ can serve as a global feature extractor, which builds its basis on a Masked Image Modeling task with a Vision Transformer architecture. On the basis of our previous work Uni4Eye, we further employ an image entropy guided masking strategy to reconstruct more-informative patches and a dynamic head generator module to alleviate modality confusion. We evaluate the performance of our pre-trained Uni4Eye++ encoder by fine-tuning it on multiple downstream ophthalmic image classification and segmentation tasks. The superiority of Uni4Eye++ is successfully established through comparisons to other state-of-the-art SSL pre-training methods. Our code is available at Github1.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Uni4Eye++: A General Masked Image Modeling Multi-modal Pre-training Framework for Ophthalmic Image Classification and Segmentation.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on medical imaging

Lead the way for us

Similar Papers

Uni4Eye: Unified 2D and 3D Self-supervised Pre-training via Masked Image Modeling Transformer for Ophthalmic Image Classification
Zhiyuan Cai ... Li Lin
-
Zhiyuan Cai, et. al.Zhiyuan Cai ... Li Lin
01 Jan 2021
01 Jan 2021

Reducing annotation burden in MR: A novel MR-contrast guided contrastive learning approach for image segmentation.
Lavanya Umapathy ... J'Rick Lu
Medical physics | VOL. 51
Lavanya Umapathy, et. al.Lavanya Umapathy ... J'Rick Lu
13 Nov 2023
Medical physics | VOL. 51

Benchmarking Self-Supervised Contrastive Learning Methods for Image-Based Plant Phenotyping.
Franklin C Ogidi ... Ian Stavness
Plant phenomics (Washington, D.C.) | VOL. 5
Franklin C Ogidi, et. al.Franklin C Ogidi ... Ian Stavness
01 Jan 2023
Plant phenomics (Washington, D.C.) | VOL. 5

Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey.
Longlong Jing ... Yingli Tian
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 43
Longlong Jing, et. al.Longlong Jing ... Yingli Tian
04 May 2020
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Uni4Eye++: A General Masked Image Modeling Multi-modal Pre-training Framework for Ophthalmic Image Classification and Segmentation.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on medical imaging