Evaluation of an Externally Trained Deep Learning-Based Auto-Segmentation Software in the Process of Artificial Intelligence-Assisted Radiation Treatment Planning for Thoracic Cancers

M Pan,L Serre,J Yousuf,K.J Hirmiz,C Michie,L Brown,J Agapito

doi:10.1016/j.ijrobp.2022.07.896

Abstract

<h3>Purpose/Objective(s)</h3> Contouring Organs-at-risk (OAR) is a laborious process that often delays radiation treatment plan design. A few FDA approved auto-segmentation software (AS) have become available. Our goal is to validate such a commercial AS in thoracic cancer OAR contouring. <h3>Materials/Methods</h3> We installed an externally trained AS into our AI computer. Validation is judged by our current gold standard contouring (GSC) by two experienced planners and one radiation oncologist (RO). We used 30 lung or esophageal cancer planning datasets to generate GSC and AI contours (AIC). Objective analysis included Dice Similarity Coefficient (DSC) and 95% Hausdorff distance (95% HD). Subjective analysis was done by two ROs to score 1 to 3 on all OARs by GSC and AIC that were randomly blended and anonymized with consistent nomenclature (1: no modification required; 2: minor modification required but adequate for clinical use; 3: major modification required and not suitable for clinical use). <h3>Results</h3> Most retrospective peer-reviewed OAR contours neglected some less important structures on CT slices typically far away from the target of the 30 patients, median age 75 years (54-90), including 22 males and 8 females, with 28 average pixel density data-sets from 4D-CT for lung cancer and 2 fast helical scans for esophageal cancer. We had to re-contour most of the OARs to generate GSC. The median GSC and AIC contouring times were 60 vs 2.5 minutes for up to 12 OARs, some of which were only partially available in the datasets (e.g., stomach and liver). Due to the inconsistency of contouring organs far away from the planning target volume, we only chose six main OARs for initial validation and analysis. Comparing AICs to GSCs, the mean DSC and 95% HD were: esophagus 0.61 and 16 mm, heart 0.85 and 13.1 mm, left lung 0.97 and 5.9 mm, right lung 0.96 and 5.7 mm, spinal cord 0.82 and 10.7 mm, trachea and proximal bronchial tree (TPB) 0.67 and 19.1 mm, respectively. The two ROs agreed with 100% of four OARs on GSC, i.e., both RO scoring 1 or 2 meaning adequate for planning purpose, with the exception of esophagus having 96.7% vs 100% and right lung having 100% vs 96.7% agreement, respectively. They had less agreement on AIC, with esophagus 90% vs 60%, heart 83.3% vs 86.7%, left lung 100% vs 96.7%, right lung 100% vs 96.7%, spinal cord 100% vs 100%, TPB 96.7% vs 86.7% agreement, respectively. The inter-observer variabilities are significantly larger when ROs evaluated esophagus and heart AIC (p=0.046 and 0.05, respectively, Student's t-test). <h3>Conclusion</h3> The accuracy of an externally trained deep learning-based AS might not be acceptable without in-house training from local protocols. Retrospective peer-reviewed OAR contours might not be good enough in the training and evaluation of AS. Our future work involves training AS using our GSCs and re-evaluating its performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluation of an Externally Trained Deep Learning-Based Auto-Segmentation Software in the Process of Artificial Intelligence-Assisted Radiation Treatment Planning for Thoracic Cancers

Abstract

Talk to us

Similar Papers

More From: International Journal of Radiation OncologyBiologyPhysics

Lead the way for us

Similar Papers

Impact of Artificial Intelligence-Based Autosegmentation of Organs at Risk in Low- and Middle-Income Countries
Solomon Kibudde ... Yao Hao
Advances in Radiation Oncology | VOL. 9
Solomon Kibudde, et. al.Solomon Kibudde ... Yao Hao
01 Nov 2024
Advances in Radiation Oncology | VOL. 9

A Prospective Observational Study of Clinical Acceptability of Deep Learning Model for the Automated Segmentation of Organs at Risk for Head and Neck Radiotherapy Treatment Planning
...
International Journal of Radiation Oncology*Biology*Physics | VOL. 114
, et. al. ...
22 Oct 2022
International Journal of Radiation Oncology*Biology*Physics | VOL. 114

Evaluation of Deep Learning-Based Auto-Segmentation of Target Volume and Organs-at-Risk in Breast Cancer Patients
S.Y Chung ... Y.B Kim
International Journal of Radiation Oncology*Biology*Physics | VOL. 108
S.Y Chung, et. al.S.Y Chung ... Y.B Kim
23 Oct 2020
International Journal of Radiation Oncology*Biology*Physics | VOL. 108

Clinical Validation and Treatment Plan Evaluation Based on Autodelineation of the Clinical Target Volume for Prostate Cancer Radiotherapy.
Jing Shen ... Yu Chen
Technology in Cancer Research & Treatment | VOL. 22
Jing Shen, et. al.Jing Shen ... Yu Chen
01 Jan 2023
Technology in Cancer Research & Treatment | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluation of an Externally Trained Deep Learning-Based Auto-Segmentation Software in the Process of Artificial Intelligence-Assisted Radiation Treatment Planning for Thoracic Cancers

Abstract

Talk to us

Similar Papers

More From: International Journal of Radiation Oncology*Biology*Physics

More From: International Journal of Radiation OncologyBiologyPhysics