Purpose90Y SPECT-based dosimetry following radioembolization (RE) in liver malignancies is challenging due to the inherent scatter and the poor spatial resolution of bremsstrahlung SPECT. This study explores a deep-learning-based absorbed dose-rate estimation method for 90Y that mitigates the impact of poor SPECT image quality on dosimetry and the accuracy–efficiency trade-off of Monte Carlo (MC)-based scatter estimation and voxel dosimetry methods.MethodsOur unified framework consists of three stages: convolutional neural network (CNN)-based bremsstrahlung scatter estimation, SPECT reconstruction with scatter correction (SC) and absorbed dose-rate map generation with a residual learning network (DblurDoseNet). The input to the framework is the measured SPECT projections and CT, and the output is the absorbed dose-rate map. For training and testing under realistic conditions, we generated a series of virtual patient phantom activity/density maps from post-therapy images of patients treated with 90Y-RE at our clinic. To train the scatter estimation network, we use the scatter projections for phantoms generated from MC simulation as the ground truth (GT). To train the dosimetry network, we use MC dose-rate maps generated directly from the activity/density maps of phantoms as the GT (Phantom + MC Dose). We compared performance of our framework (SPECT w/CNN SC + DblurDoseNet) and MC dosimetry (SPECT w/CNN SC + MC Dose) using normalized root mean square error (NRMSE) and normalized mean absolute error (NMAE) relative to GT.ResultsWhen testing on virtual patient phantoms, our CNN predicted scatter projections had NRMSE of 4.0% ± 0.7% on average. For the SPECT reconstruction with CNN SC, we observed a significant improvement on NRMSE (9.2% ± 1.7%), compared to reconstructions with no SC (149.5% ± 31.2%). In terms of virtual patient dose-rate estimation, SPECT w/CNN SC + DblurDoseNet had a NMAE of 8.6% ± 5.7% and 5.4% ± 4.8% in lesions and healthy livers, respectively; compared to 24.0% ± 6.1% and 17.7% ± 2.1% for SPECT w/CNN SC + MC Dose. In patient dose-rate maps, though no GT was available, we observed sharper lesion boundaries and increased lesion-to-background ratios with our framework. For a typical patient data set, the trained networks took ~ 1 s to generate the scatter estimate and ~ 20 s to generate the dose-rate map (matrix size: 512 × 512 × 194) on a single GPU (NVIDIA V100).ConclusionOur deep learning framework, trained using true activity/density maps, has the potential to outperform non-learning voxel dosimetry methods such as MC that are dependent on SPECT image quality. Across comprehensive testing and evaluations on multiple targeted lesions and healthy livers in virtual patients, our proposed deep learning framework demonstrated higher (66% on average in terms of NMAE) estimation accuracy than the current “gold-standard” MC method. The enhanced computing speed with our framework without sacrificing accuracy is highly relevant for clinical dosimetry following 90Y-RE.