Ground Truth Labels Research Articles

PurposeSegmentations of retinal layers in spectral-domain optical coherence tomography (SD-OCT) images serve as a crucial tool for identifying and analyzing the progression of various retinal diseases, encompassing a broad spectrum of abnormalities associated with age-related macular degeneration (AMD). The training of deep-learning algorithms necessitates well-defined ground-truth labels, validated by experts, to delineate boundaries accurately. However, this resource-intensive process has constrained the widespread application of such algorithms across diverse OCT devices. This work validates deep learning image segmentation models across multiple OCT devices by testing robustness in generating clinically relevant metrics. DesignProspective, comparative study. ParticipantsAdults over 50 years of age with no AMD to advanced AMD, as defined in the Age-Related Eye Disease Study (AREDS)), in at least one eye, were enrolled. 402 SD-OCT scans were used in this study. MethodsWe evaluate two separate state-of-the-art segmentation algorithms through a training process using images obtained from one OCT device (Heidelberg-Spectralis) and subsequent testing using images acquired from two OCT devices (Heidelberg-Spectralis and Zeiss-Cirrus). This assessment is performed on a dataset that encompasses a range of retinal pathologies, spanning from disease-free conditions to severe forms of AMD, with a focus on evaluating the device independence of the algorithms. Main Outcome MeasuresPerformance metrics (mean-squared-error, mean-absolute-error, dice-coefficients) for the segmentations of the internal limiting membrane (ILM), retinal-pigment-epithelium (RPE), and RPE to Bruch’s-membrane (BM) region, along with en face thickness maps, volumetric estimations (in mm3). Violin plots and Bland-Altman plots comparing predictions against ground-truth are also presented. ResultsThe UNet and DeepLabv3, trained on Spectralis B-scans, demonstrate clinically useful outcomes when applied to Cirrus test B-scans. Review of the Cirrus test-data by two independent annotators revealed that the aggregated-mean-absolute-error in pixels for ILM was 1.82±0.24 (equivalent to 7.0±0.9 μm) and for RPE was 2.46±0.66 (9.5±2.6 μm). Additionally, the dice-similarity-coefficient for the RPE-drusen complex (RPE-DC) region, comparing predictions to ground truth, reached 0.87±0.01. ConclusionIn the pursuit of task-specific goals such as retinal layer segmentation, a segmentation network has the capacity to acquire domain-independent features from a large training dataset. This enables the utilization of the network to execute tasks in domains where ground truth is hard to generate.

Read full abstract

Large foundation models, such as the Segment Anything Model (SAM), have shown remarkable performance in image segmentation tasks. However, the optimal approach to achieve true utility of these models for domain-specific applications, such as medical image segmentation, remains an open question. Recent studies have released a medical version of the foundation model MedSAM by training on vast medical data, who promised SOTA medical segmentation. Independent community inspection and dissection is needed. Foundation models are developed for general purposes. On the other hand, stable delivery of reliable performance is key to clinical utility. This study aims at elucidating the potential advantage and limitations of landing the foundation models in clinical use by assessing the performance of off-the-shelf medical foundation model MedSAM for the segmentation of anatomical structures in pelvic MR images. We also explore the simple remedies by evaluating the dependency on prompting scheme. Finally, we demonstrate the need and performance gain of further specialized fine-tuning. MedSAM and its lightweight version LiteMedSAM were evaluated out-of-the-box on a public MR dataset consisting of 589 pelvic images split 80:20 for training and testing. An nnU-Net model was trained from scratch to serve as a benchmark and to provide bounding box prompts for MedSAM. MedSAM was evaluated using different quality bounding boxes, those derived from ground truth labels, those derived from nnU-Net, and those derived from the former two but with 5-pixel isometric expansion. Lastly, LiteMedSAM was refined on the training set and reevaluated on this task. Out-of-the-box MedSAM and LiteMedSAM both performed poorly across the structure set, especially for disjoint or non-convex structures. Varying prompt with different bounding box inputs had minimal effect. For example, the mean Dice score and mean Hausdorff distances (in mm) for obturator internus using MedSAM and LiteMedSAM were {0.251±0.110, 0.101±0.079} and {34.142±5.196, 33.688±5.306}, respectively. Fine-tuning of LiteMedSAM led to significant performance gain, improving Dice score and Hausdorff distance for the obturator internus to 0.864±0.123 and 5.022±10.684, on par with nnU-Net with no significant difference in evaluation of most structures. All segmentation structures benefited significantly from specialized refinement, at varying improvement margin. While our study alludes to the potential of deep learning models like MedSAM and LiteMedSAM for medical segmentation, it highlights the need for specialized refinement and adjudication. Off-the-shelf use of such large foundation models is highly likely to be suboptimal, and specialized fine-tuning is often necessary to achieve clinical desired accuracy and stability.

Read full abstract

Ground Truth Labels Research Articles

Related Topics

Articles published on Ground Truth Labels

An Unsupervised Anomaly Detection in Electricity Consumption Using Reinforcement Learning and Time Series Forest Based Framework

Automated Assessment of Sarcopenia with Hounsfield Unit Average Calculation in Computed Tomography Scans Using Deep Learning Techniques

Boosting grape bunch detection in RGB-D images using zero-shot annotation with Segment Anything and GroundingDINO

Quantification of Empty Lacunae in Tissue Sections of Osteonecrosis of the Femoral Head Using YOLOv8 Artificial Intelligence Model

Automatic Detection and Classification of Aurora in THEMIS All‐Sky Images

Self-Supervised Adversarial Training of Monocular Depth Estimation Against Physical-World Attacks.

Validation of Deep Learning-based Automatic Retinal Layer Segmentation Algorithms for AMD with Two SD-OCT Devices

Classification of Pneumonia, Tuberculosis and Covid-19 from Chest X-Ray Images Using Convolution Neural Network Model

Closing the gap in domain adaptation for semantic segmentation: a time-aware method

ScVAG: Unified single-cell clustering via variational-autoencoder integration with Graph Attention Autoencoder

Motion-Aware Self-Supervised RGBT Tracking with Multi-Modality Hierarchical Transformers

Performance of an AI-powered visualization software platform for precision surgery in breast cancer patients

Gaze Zone Classification for Driving Studies Using YOLOv8 Image Classification.

Abstract 4135928: Externally Validated Deep Learning Model for Patent Ductus Arteriosus Detection by Echocardiography in Preterm Infants

Development of a Dual-Plane MRI-Based Deep Learning Model to Assess the 1-Year Postoperative Outcomes in Lumbar Disc Herniation After Tubular Microdiscectomy.

Annotation-free multi-organ anomaly detection in abdominal CT using free-text radiology reports: A multi-centre retrospective study

Automated dentition segmentation: 3D UNet-based approach with MIScnn framework

Computational staining of CD3/CD20 positive lymphocytes in human tissues with experimental confirmation in a genetically engineered mouse model.

Necessity and impact of specialization of large foundation model for medical segmentation tasks.

Optimizing Football Formation Analysis via LSTM-Based Event Detection

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Ground Truth Labels Research Articles

Related Topics

Articles published on Ground Truth Labels

An Unsupervised Anomaly Detection in Electricity Consumption Using Reinforcement Learning and Time Series Forest Based Framework

Automated Assessment of Sarcopenia with Hounsfield Unit Average Calculation in Computed Tomography Scans Using Deep Learning Techniques

Boosting grape bunch detection in RGB-D images using zero-shot annotation with Segment Anything and GroundingDINO

Quantification of Empty Lacunae in Tissue Sections of Osteonecrosis of the Femoral Head Using YOLOv8 Artificial Intelligence Model

Automatic Detection and Classification of Aurora in THEMIS All‐Sky Images

Self-Supervised Adversarial Training of Monocular Depth Estimation Against Physical-World Attacks.

Validation of Deep Learning-based Automatic Retinal Layer Segmentation Algorithms for AMD with Two SD-OCT Devices

Classification of Pneumonia, Tuberculosis and Covid-19 from Chest X-Ray Images Using Convolution Neural Network Model

Closing the gap in domain adaptation for semantic segmentation: a time-aware method

ScVAG: Unified single-cell clustering via variational-autoencoder integration with Graph Attention Autoencoder

Motion-Aware Self-Supervised RGBT Tracking with Multi-Modality Hierarchical Transformers

Performance of an AI-powered visualization software platform for precision surgery in breast cancer patients

Gaze Zone Classification for Driving Studies Using YOLOv8 Image Classification.

Abstract 4135928: Externally Validated Deep Learning Model for Patent Ductus Arteriosus Detection by Echocardiography in Preterm Infants

Development of a Dual-Plane MRI-Based Deep Learning Model to Assess the 1-Year Postoperative Outcomes in Lumbar Disc Herniation After Tubular Microdiscectomy.

Annotation-free multi-organ anomaly detection in abdominal CT using free-text radiology reports: A multi-centre retrospective study

Automated dentition segmentation: 3D UNet-based approach with MIScnn framework

Computational staining of CD3/CD20 positive lymphocytes in human tissues with experimental confirmation in a genetically engineered mouse model.

Necessity and impact of specialization of large foundation model for medical segmentation tasks.

Optimizing Football Formation Analysis via LSTM-Based Event Detection