The purpose of this work is to develop a voxel-wise dose prediction model using convolutional neural network (CNN) for cervical cancer high-dose-rate (HDR) intracavitary brachytherapy treatment planning with tandem-and-ovoid (T&O) or tandem-and-ring (T&R) applicators. A 3D U-NET CNN was implemented to generate voxel-wise dose predictions based on high-risk clinical target volume (HRCTV) and organs at risk (OAR) contour information. A multi-institutional cohort of 77 retrospective clinical HDR brachytherapy plans treated to a prescription dose in the range of 4.8-7.0 Gy/fx was used in this study. Those plans were randomly divided into 60%/20%/20% as training, validating, and testing cohorts. Data augmentation techniques were implemented in the training and validation cohort data to increase the number of plans to 252. The model was trained using the mean-squared loss function, Adam optimization algorithm, a learning rate of 0.001, 250 epochs, and a batch size of 8. The model performance was evaluated on the testing dataset by analyzing the outcomes in terms of mean dose values and derived dose-volume-histogram (DVH) indices from 3D dose distributions and comparing the generated dose distributions against the ground-truth dose distributions using dose statistics and clinically meaningful dosimetric indices. The proposed 3D U-Net model showed competitive accuracy in predicting 3D dose distributions that closely resemble the ground truth dose distributions. The average value of mean absolute error was 0.108±3.617 Gy for HRCTV, 0.074±1.315 Gy for bladder, 0.093±0.981 Gy for rectum, and 0.035±2.789 Gy for sigmoid. The median absolute error was 1.264 Gy for HRCTV, 0. 441 Gy for the bladder, 0.018 Gy for rectum, and 0.019 Gy for sigmoid. Our results showed that the predicted mean D2cc OAR doses in the bladder, rectum, and sigmoid were 4.02±0.93, 3.77±0.57 and 3.36±0.82 Gy in comparison to 4.35±0.98, 3.99±0.53, 3.75±0.83 Gy in clinical plans, respectively, and the mean individual differences (ΔD2cc) were 0.33±0.41, 0.22±0.16 and 0.39±0.45 Gy, respectively. The predicted D90 of the HRCTV was 6.72±0.99 Gy in comparison with 6.83±1.72 Gy in clinical plans. Results indicate a good potential to predict useful dosimetric indices and thus would facilitate an improvement in brachytherapy treatment planning workflow. The proposed model needs less than 5 seconds to predict a full 3D dose distribution of 64 × 64 × 64 voxels for any new patient plan, thus making it sufficient for near real-time applications and aid in decision-making in clinic. The proposed model can be used to predict 3D dose distributions for near real-time decision-making, before planning, for planning quality assurance, and for guiding future automated planning for improved plan consistency, quality, and planning efficiency.
Read full abstract