Standard U-Net Research Articles

Task automation is essential for efficient and consistent image segmentation in radiation oncology. We report on a deep learning architecture, comprising a U-Net and a variational autoencoder (VAE) for automatic contouring of the prostate gland incorporating interobserver variation for radiotherapy treatment planning. The U-Net/VAE generates an ensemble set of segmentations for each image CT slice. A novel outlier mitigation (OM) technique was implemented to enhance the model segmentation accuracy. The primary source dataset (source_prim) consisted of 19 200 CT slices (from 300 patient planning CT image datasets) with manually contoured prostate glands. A smaller secondary source dataset (source_sec) comprised 640 CT slices (from 10 patient CT datasets), where prostate glands were segmented by 5 independent physicians on each dataset to account for interobserver variability. Data augmentation via random rotation (<5 degrees), cropping, and horizontal flipping was applied to each dataset to increase sample size by a factor of 100. A probabilistic hierarchical U-Net with VAE was implemented and pretrained using the augmented source_prim dataset for 30 epochs. Model parameters of the U-Net/VAE were fine-tuned using the augmented source_sec dataset for 100 epochs. After the first round of training, outlier contours in the training dataset were automatically detected and replaced by the most accurate contours (based on Dice similarity coefficient, DSC) generated by the model. The U-Net/OM-VAE was retrained using the revised training dataset. Metrics for comparison included DSC, Hausdorff distance (HD, mm), normalized cross-correlation (NCC) coefficient, and center-of-mass (COM) distance (mm). Results for U-Net/OM-VAE with outliers replaced in the training dataset versus U-Net/VAE without OM were as follows: DSC=0.82±0.01 versus 0.80±0.02 (p=0.019), HD=9.18±1.22 versus 10.18±1.35mm (p=0.043), NCC=0.59±0.07 versus 0.62±0.06, and COM=3.36±0.81 versus 4.77±0.96mm over the average of 15 contours. For the average of 15 highest accuracy contours, values were as follows: DSC=0.90±0.02 versus 0.85±0.02, HD=5.47±0.02 versus 7.54±1.36mm, and COM=1.03±0.58 versus 1.46±0.68mm (p<0.03 for all metrics). Results for the U-Net/OM-VAE with outliers removed were as follows: DSC=0.78±0.01, HD=10.65±1.95mm, NCC=0.46±0.10, COM=4.17±0.79mm for the average of 15 contours, and DSC=0.88±0.02, HD=7.00±1.17mm, COM=1.58±0.63mm for the average of 15 highest accuracy contours. All metrics for U-Net/VAE trained on the source_prim and source_sec datasets via pretraining, followed by fine-tuning, show statistically significant improvement over that trained on the source_sec dataset only. Finally, all metrics for U-Net/VAE with or without OM showed statistically significant improvement over those for the standard U-Net. A VAE combined with a hierarchical U-Net and an OM strategy (U-Net/OM-VAE) demonstrates promise toward capturing interobserver variability and produces accurate prostate auto-contours for radiotherapy planning. The availability of multiple contours for each CT slice enables clinicians to determine trade-offs in selecting the "best fitting" contour on each CT slice. Mitigation of outlier contours in the training dataset improves prediction accuracy, but one must be wary of reduction in variability in the training dataset.

Read full abstract

PurposeDeep learning–based knowledge‐based planning (KBP) methods have been introduced for radiotherapy dose distribution prediction to reduce the planning time and maintain consistent high‐quality plans. This paper presents a novel KBP model using an attention‐gating mechanism and a three‐dimensional (3D) U‐Net for intensity‐modulated radiation therapy (IMRT) 3D dose distribution prediction in head‐and‐neck cancer.MethodsA total of 340 head‐and‐neck cancer plans, representing the OpenKBP—2020 AAPM Grand Challenge data set, were used in this study. All patients were treated with the IMRT technique and a dose prescription of 70 Gy. The data set was randomly divided into 64%/16%/20% as training/validation/testing cohorts. An attention‐gated 3D U‐Net architecture model was developed to predict full 3D dose distribution. The developed model was trained using the mean‐squared error loss function, Adam optimization algorithm, a learning rate of 0.001, 120 epochs, and batch size of 4. In addition, a baseline U‐Net model was also similarly trained for comparison. The model performance was evaluated on the testing data set by comparing the generated dose distributions against the ground‐truth dose distributions using dose statistics and clinical dosimetric indices. Its performance was also compared to the baseline model and the reported results of other deep learning‐based dose prediction models.ResultsThe proposed attention‐gated 3D U‐Net model showed high capability in accurately predicting 3D dose distributions that closely replicated the ground‐truth dose distributions of 68 plans in the test set. The average value of the mean absolute dose error was 2.972 ± 1.220 Gy (vs. 2.920 ± 1.476 Gy for a baseline U‐Net) in the brainstem, 4.243 ± 1.791 Gy (vs. 4.530 ± 2.295 Gy for a baseline U‐Net) in the left parotid, 4.622 ± 1.975 Gy (vs. 4.223 ± 1.816 Gy for a baseline U‐Net) in the right parotid, 3.346 ± 1.198 Gy (vs. 2.958 ± 0.888 Gy for a baseline U‐Net) in the spinal cord, 6.582 ± 3.748 Gy (vs. 5.114 ± 2.098 Gy for a baseline U‐Net) in the esophagus, 4.756 ± 1.560 Gy (vs. 4.992 ± 2.030 Gy for a baseline U‐Net) in the mandible, 4.501 ± 1.784 Gy (vs. 4.925 ± 2.347 Gy for a baseline U‐Net) in the larynx, 2.494 ± 0.953 Gy (vs. 2.648 ± 1.247 Gy for a baseline U‐Net) in the PTV_70, and 2.432 ± 2.272 Gy (vs. 2.811 ± 2.896 Gy for a baseline U‐Net) in the body contour. The average difference in predicting the D 99 value for the targets (PTV_70, PTV_63, and PTV_56) was 2.50 ± 1.77 Gy. For the organs at risk, the average difference in predicting the Dmax (brainstem, spinal cord, and mandible) and Dmean (left parotid, right parotid, esophagus, and larynx) values was 1.43 ± 1.01 and 2.44 ± 1.73 Gy, respectively. The average value of the homogeneity index was 7.99 ± 1.45 for the predicted plans versus 5.74 ± 2.95 for the ground‐truth plans, whereas the average value of the conformity index was 0.63 ± 0.17 for the predicted plans versus 0.89 ± 0.19 for the ground‐truth plans. The proposed model needs less than 5 s to predict a full 3D dose distribution of 64 × 64 × 64 voxels for a new patient that is sufficient for real‐time applications.ConclusionsThe attention‐gated 3D U‐Net model demonstrated a capability in predicting accurate 3D dose distributions for head‐and‐neck IMRT plans with consistent quality. The prediction performance of the proposed model was overall superior to a baseline standard U‐Net model, and it was also competitive to the performance of the best state‐of‐the‐art dose prediction method reported in the literature. The proposed model could be used to obtain dose distributions for decision‐making before planning, quality assurance of planning, and guiding‐automated planning for improved plan consistency, quality, and planning efficiency.

Read full abstract

Standard U-Net Research Articles

Related Topics

Articles published on Standard U-Net

A novel approach for semantic segmentation of automatic road network extractions from remote sensing images by modified UNet

STAMP: Simultaneous Training and Model Pruning for low data regimes in medical image segmentation.

An uncertainty-aware deep learning architecture with outlier mitigation for prostate gland segmentation in radiotherapy treatment planning.

Using high-resolution imagery and deep learning to classify land-use following deforestation: a case study in Ethiopia

Weighted average ensemble-based semantic segmentation in biological electron microscopy images

ConvUNeXt: An efficient convolution neural network for medical image segmentation

CEL-Unet: a novel CNN architecture for 3D Segmentation of Knee Bones affected by Severe Osteoarthritis for PSI-Based Surgical Planning.

Retinal Vessel Segmentation Based on the Anam-Net Model

Attention-aware 3D U-Net convolutional neural network for knowledge-based planning 3D dose distribution prediction of head-and-neck cancer.

Automatic coastline extraction through enhanced sea-land segmentation by modifying Standard U-Net

Joint Dense Residual and Recurrent Attention Network for DCE-MRI Breast Tumor Segmentation.

State-of-the-art retinal vessel segmentation with minimalistic models

Feasibility of the soft attention-based models for automatic segmentation of OCT kidney images.

Automatic Left Ventricle Segmentation from Short-Axis Cardiac MRI Images Based on Fully Convolutional Neural Network.

A Modular System Based on U-Net for Automatic Building Extraction from very high-resolution satellite images

Multi-Scale ConvLSTM Attention-Based Brain Tumor Segmentation

TransNorm: Transformer Provides a Strong Spatial Normalization Mechanism for a Deep Segmentation Model

An Improved Kidney Tumor Prediction Using Deep Convolutional Neural Network-Restricted Boltzmann Machine Technique in Medical Image Segmentation

Graph neural networks for laminar flow prediction around random two-dimensional shapes

Randomly connected neural networks for self-supervised monocular depth estimation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Standard U-Net Research Articles

Related Topics

Articles published on Standard U-Net

A novel approach for semantic segmentation of automatic road network extractions from remote sensing images by modified UNet

STAMP: Simultaneous Training and Model Pruning for low data regimes in medical image segmentation.

An uncertainty-aware deep learning architecture with outlier mitigation for prostate gland segmentation in radiotherapy treatment planning.

Using high-resolution imagery and deep learning to classify land-use following deforestation: a case study in Ethiopia

Weighted average ensemble-based semantic segmentation in biological electron microscopy images

ConvUNeXt: An efficient convolution neural network for medical image segmentation

CEL-Unet: a novel CNN architecture for 3D Segmentation of Knee Bones affected by Severe Osteoarthritis for PSI-Based Surgical Planning.

Retinal Vessel Segmentation Based on the Anam-Net Model

Attention-aware 3D U-Net convolutional neural network for knowledge-based planning 3D dose distribution prediction of head-and-neck cancer.

Automatic coastline extraction through enhanced sea-land segmentation by modifying Standard U-Net

Joint Dense Residual and Recurrent Attention Network for DCE-MRI Breast Tumor Segmentation.

State-of-the-art retinal vessel segmentation with minimalistic models

Feasibility of the soft attention-based models for automatic segmentation of OCT kidney images.

Automatic Left Ventricle Segmentation from Short-Axis Cardiac MRI Images Based on Fully Convolutional Neural Network.

A Modular System Based on U-Net for Automatic Building Extraction from very high-resolution satellite images

Multi-Scale ConvLSTM Attention-Based Brain Tumor Segmentation

TransNorm: Transformer Provides a Strong Spatial Normalization Mechanism for a Deep Segmentation Model

An Improved Kidney Tumor Prediction Using Deep Convolutional Neural Network-Restricted Boltzmann Machine Technique in Medical Image Segmentation

Graph neural networks for laminar flow prediction around random two-dimensional shapes

Randomly connected neural networks for self-supervised monocular depth estimation