The time-consuming and user-dependent postprocessing of biochemical cartilage MRI has limited the use of delayed gadolinium-enhanced MRI of cartilage (dGEMRIC). An automated analysis of biochemical three-dimensional (3-D) images could deliver a more time-efficient and objective evaluation of cartilage composition, and provide comprehensive information about cartilage thickness, surface area, and volume compared with manual two-dimensional (2-D) analysis. (1) How does the 3-D analysis of cartilage thickness and dGEMRIC index using both a manual and a new automated method compare with the manual 2-D analysis (gold standard)? (2) How does the manual 3-D analysis of regional patterns of dGEMRIC index, cartilage thickness, surface area and volume compare with a new automatic method? (3) What is the interobserver reliability and intraobserver reproducibility of software-assisted manual 3-D and automated 3-D analysis of dGEMRIC indices, thickness, surface, and volume for two readers on two time points? In this IRB-approved, retrospective, diagnostic study, we identified the first 25 symptomatic hips (23 patients) who underwent a contrast-enhanced MRI at 3T including a 3-D dGEMRIC sequence for intraarticular pathology assessment due to structural hip deformities. Of the 23 patients, 10 (43%) were male, 16 (64%) hips had a cam deformity and 16 (64%) hips had either a pincer deformity or acetabular dysplasia. The development of an automated deep-learning-based approach for 3-D segmentation of hip cartilage models was based on two steps: First, one reader (FS) provided a manual 3-D segmentation of hip cartilage, which served as training data for the neural network and was used as input data for the manual 3-D analysis. Next, we developed the deep convolutional neural network to obtain an automated 3-D cartilage segmentation that we used as input data for the automated 3-D analysis. For actual analysis of the manually and automatically generated 3-D cartilage models, a dedicated software was developed. Manual 2-D analysis of dGEMRIC indices and cartilage thickness was performed at each "full-hour" position on radial images and served as the gold standard for comparison with the corresponding measurements of the manual and the automated 3-D analysis. We measured dGEMRIC index, cartilage thickness, surface area, and volume for each of the four joint quadrants and compared the manual and the automated 3-D analyses using mean differences. Agreement between the techniques was assessed using intraclass correlation coefficients (ICC). The overlap between 3-D cartilage volumes was assessed using dice coefficients and means of all distances between surface points of the models were calculated as average surface distance. The interobserver reliability and intraobserver reproducibility of the software-assisted manual 3-D and the automated 3-D analysis of dGEMRIC indices, thickness, surface and volume was assessed for two readers on two different time points using ICCs. Comparable mean overall difference and almost-perfect agreement in dGEMRIC indices was found between the manual 3-D analysis (8 ± 44 ms, p = 0.005; ICC = 0.980), the automated 3-D analysis (7 ± 43 ms, p = 0.015; ICC = 0.982), and the manual 2-D analysis.Agreement for measuring overall cartilage thickness was almost perfect for both 3-D methods (ICC = 0.855 and 0.881) versus the manual 2-D analysis. A mean difference of -0.2 ± 0.5 mm (p < 0.001) was observed for overall cartilage thickness between the automated 3-D analysis and the manual 2-D analysis; no such difference was observed between the manual 3-D and the manual 2-D analysis.Regional patterns were comparable for both 3-D methods. The highest dGEMRIC indices were found posterosuperiorly (manual: 602 ± 158 ms; p = 0.013, automated: 602 ± 158 ms; p = 0.012). The thickest cartilage was found anteroinferiorly (manual: 5.3 ± 0.8 mm, p < 0.001; automated: 4.3 ± 0.6 mm; p < 0.001). The smallest surface area was found anteroinferiorly (manual: 134 ± 60 mm; p < 0.001, automated: 155 ± 60 mm; p < 0.001). The largest volume was found anterosuperiorly (manual: 2343 ± 492 mm; p < 0.001, automated: 2294 ± 467 mm; p < 0.001). Mean average surface distance was 0.26 ± 0.13 mm and mean Dice coefficient was 86% ± 3%. Intraobserver reproducibility and interobserver reliability was near perfect for overall analysis of dGEMRIC indices, thickness, surface area, and volume (ICC range, 0.962-1). The presented deep learning approach for a fully automatic segmentation of hip cartilage enables an accurate, reliable and reproducible analysis of dGEMRIC indices, thickness, surface area, and volume. This time-efficient and objective analysis of biochemical cartilage composition and morphology yields the potential to improve patient selection in femoroacetabular impingement (FAI) surgery and to aid surgeons with planning of acetabuloplasty and periacetabular osteotomies in pincer FAI and hip dysplasia. In addition, this validation paves way to the large-scale use of this method for prospective trials which longitudinally monitor the effect of reconstructive hip surgery and the natural course of osteoarthritis. Level III, diagnostic study.
Read full abstract