Learning For Medical Image Analysis Research Articles

Pre-training deep learning models with large data sets of natural images, such as ImageNet, has become the standard for endoscopic image analysis. This approach is generally superior to training from scratch, due to the scarcity of high-quality medical imagery and labels. However, it is still unknown whether the learned features on natural imagery provide an optimal starting point for the downstream medical endoscopic imaging tasks. Intuitively, pre-training with imagery closer to the target domain could lead to better-suited feature representations. This study evaluates whether leveraging in-domain pre-training in gastrointestinal endoscopic image analysis has potential benefits compared to pre-training on natural images.To this end, we present a dataset comprising of 5,014,174 gastrointestinal endoscopic images from eight different medical centers (GastroNet-5M), and exploit self-supervised learning with SimCLRv2, MoCov2 and DINO to learn relevant features for in-domain downstream tasks. The learned features are compared to features learned on natural images derived with multiple methods, and variable amounts of data and/or labels (e.g. Billion-scale semi-weakly supervised learning and supervised learning on ImageNet-21k). The effects of the evaluation is performed on five downstream data sets, particularly designed for a variety of gastrointestinal tasks, for example, GIANA for angiodyplsia detection and Kvasir-SEG for polyp segmentation.The findings indicate that self-supervised domain-specific pre-training, specifically using the DINO framework, results into better performing models compared to any supervised pre-training on natural images. On the ResNet50 and Vision-Transformer-small architectures, utilizing self-supervised in-domain pre-training with DINO leads to an average performance boost of 1.63% and 4.62%, respectively, on the downstream datasets. This improvement is measured against the best performance achieved through pre-training on natural images within any of the evaluated frameworks.Moreover, the in-domain pre-trained models also exhibit increased robustness against distortion perturbations (noise, contrast, blur, etc.), where the in-domain pre-trained ResNet50 and Vision-Transformer-small with DINO achieved on average 1.28% and 3.55% higher on the performance metrics, compared to the best performance found for pre-trained models on natural images.Overall, this study highlights the importance of in-domain pre-training for improving the generic nature, scalability and performance of deep learning for medical image analysis. The GastroNet-5M pre-trained weights are made publicly available in our repository: huggingface.co/tgwboers/GastroNet-5M_Pretrained_Weights.

Read full abstract

Background: Celiac disease arises from gluten consumption and shares symptoms with other conditions, leading to delayed diagnoses. Untreated celiac disease heightens the risk of autoimmune disorders, neurological issues, and certain cancers like lymphoma while also impacting skin health due to intestinal disruptions. This study uses facial photos to distinguish individuals with celiac disease from those without. Surprisingly, there is a lack of research involving transfer learning for this purpose despite its benefits such as faster training, enhanced performance, and reduced overfitting. While numerous studies exist on endoscopic intestinal photo classification and a few have explored the link between facial morphology measurements and celiac disease, none have concentrated on diagnosing celiac disease through facial photo classification. Methods: This study sought to apply transfer learning techniques with VGG16 to address a gap in research by identifying distinct facial features that differentiate patients with celiac disease from healthy individuals. A dataset containing a total of 200 facial images of adult individuals with and without celiac condition was utilized. Half of the dataset had a ratio of 70% females to 30% males with celiac condition, and the rest had a ratio of 60% females to 40% males without celiac condition. Among those with celiac condition, 28 were newly diagnosed and 72 had been previously diagnosed, with 25 not adhering to a gluten-free diet and 47 partially adhering to such a diet. Results: Utilizing transfer learning, the model achieved a 73% accuracy in classifying the facial images of the patients during testing, with corresponding precision, recall, and F1 score values of 0.54, 0.56, and 0.52, respectively. The training process involved 50,178 parameters, showcasing the model’s efficacy in diagnostic image analysis. Conclusions: The model correctly classified approximately three-quarters of the test images. While this is a reasonable level of accuracy, it also suggests that there is room for improvement as the dataset contains images that are inherently difficult to classify even for humans. Increasing the proportion of newly diagnosed patients in the dataset and expanding the dataset size could notably improve the model’s efficacy. Despite being the first study in this field, further refinement holds promise for the development of a diagnostic tool for celiac disease using transfer learning in medical image analysis, addressing the lack of prior studies in this area.

Read full abstract

Learning For Medical Image Analysis Research Articles

Articles published on Learning For Medical Image Analysis

A Comprehensive Review in Exploring the Role of Reinforcement Learning in Medical Image Analysis

Construction and Validation of a General Medical Image Dataset for Pretraining.

Foundation models in gastrointestinal endoscopic AI: Impact of architecture, pre-training approach and data efficiency

Multistage transfer learning for medical images

IGU-Aug: Information-guided unsupervised augmentation and pixel-wise contrastive learning for medical image analysis.

Innovative Approaches to Clinical Diagnosis: Transfer Learning in Facial Image Classification for Celiac Disease Identification

Enhancing Diagnostic: Machine Learning in Medical Image Analysis

Enhancing Diagnostic: Machine Learning in Medical Image Analysis

Deep Learning Approaches for Medical Image Analysis and Diagnosis.

A hybrid approach based on multipath Swin transformer and ConvMixer for white blood cells classification

Self-supervised learning for medical image analysis: a comprehensive review

Bio-Medical Image Segmentation And Detection For Brain Tumour And Skin Lesions Diseases Through U-NET

Self-supervised multi-task learning for medical image analysis

Self-supervised learning for medical image analysis: Discriminative, restorative, or adversarial?

Deep Machine Learning for Medical Diagnosis, Application to Lung Cancer Detection: A Review

A Review on Medical Image Applications Based on Deep Learning Techniques

Efficient 3D Representation Learning for Medical Image Analysis

Federated Learning in Medical Image Analysis: A Systematic Survey

Dive into the details of self-supervised learning for medical image analysis.

CReg-KD: Model refinement via confidence regularized knowledge distillation for brain imaging.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Learning For Medical Image Analysis Research Articles

Articles published on Learning For Medical Image Analysis

A Comprehensive Review in Exploring the Role of Reinforcement Learning in Medical Image Analysis

Construction and Validation of a General Medical Image Dataset for Pretraining.

Foundation models in gastrointestinal endoscopic AI: Impact of architecture, pre-training approach and data efficiency

Multistage transfer learning for medical images

IGU-Aug: Information-guided unsupervised augmentation and pixel-wise contrastive learning for medical image analysis.

Innovative Approaches to Clinical Diagnosis: Transfer Learning in Facial Image Classification for Celiac Disease Identification

Enhancing Diagnostic: Machine Learning in Medical Image Analysis

Enhancing Diagnostic: Machine Learning in Medical Image Analysis

Deep Learning Approaches for Medical Image Analysis and Diagnosis.

A hybrid approach based on multipath Swin transformer and ConvMixer for white blood cells classification

Self-supervised learning for medical image analysis: a comprehensive review

Bio-Medical Image Segmentation And Detection For Brain Tumour And Skin Lesions Diseases Through U-NET

Self-supervised multi-task learning for medical image analysis

Self-supervised learning for medical image analysis: Discriminative, restorative, or adversarial?

Deep Machine Learning for Medical Diagnosis, Application to Lung Cancer Detection: A Review

A Review on Medical Image Applications Based on Deep Learning Techniques

Efficient 3D Representation Learning for Medical Image Analysis

Federated Learning in Medical Image Analysis: A Systematic Survey

Dive into the details of self-supervised learning for medical image analysis.

CReg-KD: Model refinement via confidence regularized knowledge distillation for brain imaging.