Accurate measurement of ureteral diameters plays a pivotal role in diagnosing and monitoring urinary tract obstruction (UTO). While three-dimensional magnetic resonance urography (3D MRU) represents a significant advancement in imaging, the traditional manual methods for assessing ureteral diameters are characterized by labor-intensive procedures and inherent variability. In the realm of medical image analysis, deep learning has led to a paradigm shift, yet the development of a comprehensive automated tool for the precise segmentation and measurement of ureters in MR images is an unaddressed challenge. The ureter was quantitatively measured on 3D MRU images using a deep learning model. A retrospective cohort of 445 3D MRU scans (443 patients, 52±18 years; 217 female patients) was collected and split into training, validation, and internal testing cohorts. A 3D V-Net model was trained for urinary tract segmentation, and a post-processing algorithm was developed for ureteral measurements. The accuracy of the segmentation was evaluated using the Dice similarity coefficient (DSC) and volume intraclass correlation coefficient (ICC), with ground truth segmentations provided by experienced radiologists. The external cohort comprised 50 scans (50 patients, 55±21 years; 30 female patients), and the model-predicted ureteral diameter measurements were compared with manual measurements to assess system performance. The various diameter parameters of ureter among the different measurement methods (ground truth, auto-segmentation with automatic diameter extraction, and manual segmentation with automatic diameter extraction) were assessed with Friedman tests and post hoc Dunn test. The effectiveness of the UTO diagnosis was assessed by receiver operating characteristic (ROC) curves and their respective areas under the curve (AUC) between different methods. In both the internal test and external cohorts, the mean DSC values for bilateral ureters exceeded 0.70. The ICCs for the bilateral ureter volume obtained by comparing the model and manual segmentation were all greater than 0.96 (p < 0.05), except for the right ureter in the internal test cohort, for which the ICC was 0.773 (p < 0.05). The mean DSCs for interobserver and intraobserver reliability were all above 0.97. The maximum diameter of the ureter exhibited no statistically significant differences either in the dilated (p=0.08) or in the non-dilated (p=0.32) ureters across the three measurement methods. The AUCs of ground truth, auto-segmentation with automatic diameter extraction, and manual segmentation with automatic diameter extraction in diagnosing UTO were 0.988 (95% CI: 0.934, 1.000), 0.961 (95% CI: 0.893, 0.991), and 0.979 (95% CI: 0.919, 0.998), respectively. There was no statistical difference between AUCs of the different methods (p>0.05). The proposed deep learning model and post-processing algorithm provide an effective means for the quantitative evaluation of urinary diseases using 3D MRU images.
Read full abstract