Small Dataset Size Research Articles

Background and objectivesBio-medical image segmentation models typically attempt to predict one segmentation that resembles a ground-truth structure as closely as possible. However, as medical images are not perfect representations of anatomy, obtaining this ground truth is not possible. A surrogate commonly used is to have multiple expert observers define the same structure for a dataset. When multiple observers define the same structure on the same image there can be significant differences depending on the structure, image quality/modality and the region being defined. It is often desirable to estimate this type of aleatoric uncertainty in a segmentation model to help understand the region in which the true structure is likely to be positioned. Furthermore, obtaining these datasets is resource intensive so training such models using limited data may be required. With a small dataset size, differing patient anatomy is likely not well represented causing epistemic uncertainty which should also be estimated so it can be determined for which cases the model is effective or not. MethodsWe use a 3D probabilistic U-Net to train a model from which several segmentations can be sampled to estimate the range of uncertainty seen between multiple observers. To ensure that regions where observers disagree most are emphasised in model training, we expand the Generalised Evidence Lower Bound (ELBO) with a Constrained Optimisation (GECO) loss function with an additional contour loss term to give attention to this region. Ensemble and Monte-Carlo dropout (MCDO) uncertainty quantification methods are used during inference to estimate model confidence on an unseen case. We apply our methodology to two radiotherapy clinical trial datasets, a gastric cancer trial (TOPGEAR, TROG 08.08) and a post-prostatectomy prostate cancer trial (RAVES, TROG 08.03). Each dataset contains only 10 cases each for model development to segment the clinical target volume (CTV) which was defined by multiple observers on each case. An additional 50 cases are available as a hold-out dataset for each trial which had only one observer define the CTV structure on each case. Up to 50 samples were generated using the probabilistic model for each case in the hold-out dataset. To assess performance, each manually defined structure was matched to the closest matching sampled segmentation based on commonly used metrics. ResultsThe TOPGEAR CTV model achieved a Dice Similarity Coefficient (DSC) and Surface DSC (sDSC) of 0.7 and 0.43 respectively with the RAVES model achieving 0.75 and 0.71 respectively. Segmentation quality across cases in the hold-out datasets was variable however both the ensemble and MCDO uncertainty estimation approaches were able to accurately estimate model confidence with a p-value < 0.001 for both TOPGEAR and RAVES when comparing the DSC using the Pearson correlation coefficient. ConclusionsWe demonstrated that training auto-segmentation models which can estimate aleatoric and epistemic uncertainty using limited datasets is possible. Having the model estimate prediction confidence is important to understand for which unseen cases a model is likely to be useful.

Abstract Checkpoint blockade immunotherapy is a cornerstone of lung cancer treatment, but there is a need to improve the identification of patients who will respond favorably. Here, we explored a deep learning approach to predict immunotherapy outcomes from hematoxylin and eosin (H&E) images in non-small cell lung cancer (NSCLC). We included 150 unique cases with metastatic NSCLC (113 adenocarcinoma, 29 squamous cell, 8 other) treated with anti-PD-1/PD-L1 immunotherapy (56 nivolumab, 49 atezolizumab, 44 pembrolizumab, 1 durvalumab) as mono or combination (14 with chemotherapy, 1 with ipilimumab) therapy in a single institution. Each case consisted of a representative H&E whole slide image (53 biopsies, 50 needle core biopsies, 47 resections) obtained prior to immunotherapy, and the outcome reported as the 1-year overall survival (OS). PD-L1 status (tumor proportion score ≥ 1%) was known for 70 cases. We preprocessed the H&E images using two deep learning models previously developed using The Cancer Genome Atlas dataset. First, we used a classification model to identify tumor regions and randomly sampled a fixed number of tumor patches for each case. Then, we used a self-supervised pathology foundation model to obtain a compressed visual representation of each patch, known as an embedding. Next, using our dataset, we trained a deep multiple instance learning (DeepMIL) model with a gated attention mechanism to predict the binary 1-year OS status (0=deceased, 1=alive) for each case. As a baseline, we also trained a linear-probe (logistic regression) model using the averaged embeddings. Given the small dataset size, 5-fold cross-validation was used to train and evaluate both the DeepMIL and linear-probe models, with cases randomly split across folds. For evaluation, we used survival analysis to compare the 0/1 case groups. Overall, across all 150 cases, univariable Cox regression showed that 1-year OS was more strongly associated with the DeepMIL status (46/104, HR=0.55, p=0.03) than the linear-probe status (55/95, HR=0.81, p=0.44). Results were consistent on the subset of 70 cases with known PD-L1 status, whereby OS was most strongly associated with the DeepMIL status (19/51, HR=0.40, p=0.04) compared to the linear-probe status (31/39, HR=0.46, p=0.09) and PD-L1 status (30/40, HR=0.65, p=0.32). In multivariable Cox regression adjusting for age group and smoking status, OS remained more strongly associated with the DeepMIL status (HR=0.45, p=0.08) than PD-L1 status (HR=0.72, p=0.47). In conclusion, the DeepMIL status predicted from H&E images showed a stronger association with outcomes compared to PD-L1 status, a standard biomarker for immunotherapy in NSCLC. These exploratory results demonstrate the potential of deep learning using pathology foundation models to improve immunotherapy outcomes prediction, even with small datasets. Such approaches may even enable the discovery of novel biomarkers from H&E images to advance precision medicine. Citation Format: Jessica Loo, Yang Wang, Pok Fai Wong, Ellery Wulczyn, Jeremy Lai, Peter Cimermancic, David F. Steiner, Shamira S. Weaver. Predicting immunotherapy outcomes from H&E images in lung cancer [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2024; Part 1 (Regular Abstracts); 2024 Apr 5-10; San Diego, CA. Philadelphia (PA): AACR; Cancer Res 2024;84(6_Suppl):Abstract nr 7380.

Small Dataset Size Research Articles

Related Topics

Articles published on Small Dataset Size

Transferable machine learning model for the aerodynamic prediction of swept wings

Patch-and-amplify Capsule Network for the recognition of gastrointestinal diseases

Harmonizing heterogeneous transcriptomics datasets for machine learning-based analysis to identify spaceflown murine liver-specific changes

DPML: Prior-guided multitask learning for dental object recognition on limited panoramic radiograph dataset

Uncertainty estimation using a 3D probabilistic U-Net for segmentation with small radiotherapy clinical trial datasets

An Effective Classification of Brain Tumor using Deep Learning Techniques

An effective ensemble learning approach for classification of glioma grades based on novel MRI features

Strategic Selection of Machine Learning Models for Short-term Trading Optimization

An approach to estimate the low cycle fatigue probabilistic curves of PBF-LB/M 316L steel from small size datasets using the remora optimization algorithm

Machine learning for determination of activity of water and activity coefficients of electrolytes in binary solutions

Enhancing performance of vision transformers on small datasets through local inductive bias incorporation

Sparse L0-norm least squares support vector machine with feature selection

Use of 3D-CAPSNET and RNN models for 4D fMRI-based Alzheimer’s Disease Pre-detection

Sentiment analysis on a low-resource language dataset using multimodal representation learning and cross-lingual transfer learning

Adaptive Stacking Ensemble Techniques for Early Severity Classification of COVID-19 Patients

Improving Transferability for Cross-Domain Trajectory Prediction via Neural Stochastic Differential Equation

Abstract 7380: Predicting immunotherapy outcomes from H&E images in lung cancer

A differentiable first-order rule learner for inductive logic programming

Evolutionary neuron-level transfer learning for QoT estimation in optical networks

Military Unmanned Equipment Image Target Recognition Method based on Improved Deep Learning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Small Dataset Size Research Articles

Related Topics

Articles published on Small Dataset Size

Transferable machine learning model for the aerodynamic prediction of swept wings

Patch-and-amplify Capsule Network for the recognition of gastrointestinal diseases

Harmonizing heterogeneous transcriptomics datasets for machine learning-based analysis to identify spaceflown murine liver-specific changes

DPML: Prior-guided multitask learning for dental object recognition on limited panoramic radiograph dataset

Uncertainty estimation using a 3D probabilistic U-Net for segmentation with small radiotherapy clinical trial datasets

An Effective Classification of Brain Tumor using Deep Learning Techniques

An effective ensemble learning approach for classification of glioma grades based on novel MRI features

Strategic Selection of Machine Learning Models for Short-term Trading Optimization

An approach to estimate the low cycle fatigue probabilistic curves of PBF-LB/M 316L steel from small size datasets using the remora optimization algorithm

Machine learning for determination of activity of water and activity coefficients of electrolytes in binary solutions

Enhancing performance of vision transformers on small datasets through local inductive bias incorporation

Sparse L0-norm least squares support vector machine with feature selection

Use of 3D-CAPSNET and RNN models for 4D fMRI-based Alzheimer’s Disease Pre-detection

Sentiment analysis on a low-resource language dataset using multimodal representation learning and cross-lingual transfer learning

Adaptive Stacking Ensemble Techniques for Early Severity Classification of COVID-19 Patients

Improving Transferability for Cross-Domain Trajectory Prediction via Neural Stochastic Differential Equation

Abstract 7380: Predicting immunotherapy outcomes from H&amp;E images in lung cancer

A differentiable first-order rule learner for inductive logic programming

Evolutionary neuron-level transfer learning for QoT estimation in optical networks

Military Unmanned Equipment Image Target Recognition Method based on Improved Deep Learning

Abstract 7380: Predicting immunotherapy outcomes from H&E images in lung cancer