Public Datasets Research Articles

Self-supervised learning (SSL) is an approach to extract useful feature representations from unlabeled data, and enable fine-tuning on downstream tasks with limited labeled examples. Self-pretraining is a SSL approach that uses curated downstream task dataset for both pretraining and fine-tuning. Availability of large, diverse, and uncurated public medical image sets presents the opportunity to potentially create foundation models by applying SSL in the "wild" that are robust to imaging variations. However, the benefit of wild- versus self-pretraining has not been studied for medical imageanalysis. Compare robustness of wild versus self-pretrained models created using convolutional neural network (CNN) and transformer (vision transformer [ViT] and hierarchical shifted window [Swin]) models for non-small cell lung cancer (NSCLC) segmentation from 3D computed tomography (CT) scans. CNN, ViT, and Swin models were wild-pretrained using unlabeled 10,412 3D CTs sourced from the cancer imaging archive and internal datasets. Self-pretraining was applied to same networks using a curated public downstream task dataset (n = 377) of patients with NSCLC. Pretext tasks introduced in self-distilled masked image transformer were used for both pretraining approaches. All models were fine-tuned to segment NSCLC (n = 377 training dataset) and tested on two separate datasets containing early (public n = 156) and advanced stage (internal n = 196) NSCLC. Models were evaluated in terms of: (a) accuracy, (b) robustness to image differences from contrast, slice thickness, and reconstruction kernels, and (c) impact of pretext tasks for pretraining. Feature reuse was evaluated using centered kernel alignment. Wild-pretrained Swin models resulted in higher feature reuse at earlier level layers and increased feature differentiation close to output. Wild-pretrained Swin outperformed self-pretrained models for analyzed imaging acquisitions. Neither ViT nor CNN showed a clear benefit of wild-pretraining compared to self-pretraining. Masked image prediction pretext task that forces networks to learn the local structure resulted in higher accuracy compared to contrastive task that models global image information. Wild-pretrained Swin networks were more robust to analyzed CT imaging differences for lung tumor segmentation than self-pretrained methods. ViT and CNN models did not show a clear benefit for wild-pretraining overself-pretraining.

Read full abstract

Public Datasets Research Articles

Related Topics

Articles published on Public Datasets

Cost-Aware Calibration of Classifiers

From Detection to Action: A Multimodal AI Framework for Traffic Incident Response

Modelling Concept Drift in Dynamic Data Streams for Recommender Systems

UnifiedCut: A Simple and Efficient Neural Model for Thai, Burmese and Khmer Word Segmentation

MTGWNN: A Multi‐Template Graph Wavelet Neural Network Identification Model for Autism Spectrum Disorder

Trends and variation in issuance of high-cost narcolepsy drugs by NHS England organisations and regions from 2019 to 2022.

Effect of patient-contextual skin images in human- and artificial intelligence-based diagnosis of melanoma: Results from the 2020 SIIM-ISIC melanoma classification challenge.

Local Pyramid Vision Transformer: Millimeter-Wave Radar Gesture Recognition Based on Transformer with Integrated Local and Global Awareness

ChromTR: chromosome detection in raw metaphase cell images via deformable transformers.

Identification of genetic associations between acute myocardial infarction and non-small cell lung cancer

A YOLO Network Based on Depthwise Convolution Attention, Feature Fusion, and KL Divergence (DFK-YOLO): A Deep Learning Method for Infrared Small Target Detection Based on YOLOv7

End to End Leaf Detection Using Convolutional Neural Network

Multi-source domain generalization tool wear prediction based on wide convolution weighted antagonism

Photonic diffractive generators through sampling noises from scattering media

LR-SLAM: Visual Inertial SLAM System with Redundant Line Feature Elimination

Improve myocardial strain estimation based on deformable groupwise registration with a locally low-rank dissimilarity metric

PURE: a Prompt-based framework with dynamic Update mechanism for educational Relation Extraction

A motor imagery classification model based on hybrid brain-computer interface and multitask learning of electroencephalographic and electromyographic deep features

Self-supervised learning improves robustness of deep learning lung tumor segmentation models to CT imaging differences.

A novel convolutional neural network for enhancing the continuity of pavement crack detection

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Public Datasets Research Articles

Related Topics

Articles published on Public Datasets

Cost-Aware Calibration of Classifiers

From Detection to Action: A Multimodal AI Framework for Traffic Incident Response

Modelling Concept Drift in Dynamic Data Streams for Recommender Systems

UnifiedCut: A Simple and Efficient Neural Model for Thai, Burmese and Khmer Word Segmentation

MTGWNN: A Multi‐Template Graph Wavelet Neural Network Identification Model for Autism Spectrum Disorder

Trends and variation in issuance of high-cost narcolepsy drugs by NHS England organisations and regions from 2019 to 2022.

Effect of patient-contextual skin images in human- and artificial intelligence-based diagnosis of melanoma: Results from the 2020 SIIM-ISIC melanoma classification challenge.

Local Pyramid Vision Transformer: Millimeter-Wave Radar Gesture Recognition Based on Transformer with Integrated Local and Global Awareness

ChromTR: chromosome detection in raw metaphase cell images via deformable transformers.

Identification of genetic associations between acute myocardial infarction and non-small cell lung cancer

A YOLO Network Based on Depthwise Convolution Attention, Feature Fusion, and KL Divergence (DFK-YOLO): A Deep Learning Method for Infrared Small Target Detection Based on YOLOv7

End to End Leaf Detection Using Convolutional Neural Network

Multi-source domain generalization tool wear prediction based on wide convolution weighted antagonism

Photonic diffractive generators through sampling noises from scattering media

LR-SLAM: Visual Inertial SLAM System with Redundant Line Feature Elimination

Improve myocardial strain estimation based on deformable groupwise registration with a locally low-rank dissimilarity metric

PURE: a Prompt-based framework with dynamic Update mechanism for educational Relation Extraction

A motor imagery classification model based on hybrid brain-computer interface and multitask learning of electroencephalographic and electromyographic deep features

Self-supervised learning improves robustness of deep learning lung tumor segmentation models to CT imaging differences.

A novel convolutional neural network for enhancing the continuity of pavement crack detection