PurposeTo compare the performance of artificial intelligence (AI) in auto-contouring compared to a human practitioner in terms of precision, differences in dose distribution and time consumption. Methods and MaterialsDatasets of previously irradiated patients in three different segments (head and neck- (HNC), breast- (BC) and prostate cancer (PC)) were retrospectively collected. An experienced radiation oncologist (MD) performed organs-at-risk (OARs) and standard clinical target volume (CTV) delineations as baseline structures for comparison. AI-based auto-contours were generated in two additional CT copies; therefore, three groups were assessed: MD alone, AI alone, AI plus MD corrections (AI+C). Differences in Dice similarity coefficient (DSC) and person-hour burden were assessed. Furthermore, changes in clinically relevant dose-volume parameters were evaluated and compared. ResultsSeventy-five previously treated cases were collected (25 per segment) for the analysis. When compared to MD contours, the mean DSC scores were higher than 0.7 for 74% and 80% of AI and AI+C, respectively. After corrections, 17.1% structures presented DSC score deviations higher than 0.1 and 10.4% dose-volume parameters significantly changed in AI-contoured structures. The time consumption assessment yielded mean person-hour reductions of 68%, 51% and 71% for BC, PC and HNC, respectively. ConclusionIn great extent, AI yielded clinically acceptable OARs and certain CTVs in the explored anatomical segments. Sparse correction and assessment requirements place AI+C as a standard workflow. Minimal clinically relevant differences in OAR-exposure were identified. A substantial amount of person-hours could be repurposed with this technology.
Read full abstract