PurposeThe purpose of this study was to assess the inter- and intra-rater reliability of 21 anatomical landmarks initially placed with an AI-algorithm and then manually verified with human input. Methods30 knee CT scans of participants from the Multicenter Osteoarthritis Study (MOST) ages 45-55 years were included. Approximately one-half experienced progression of patellofemoral osteoarthritis, defined as an increased cartilage score in the patellofemoral compartment on MRI over 2 years. The algorithm automatically placed 19 anatomic landmarks on the femur, tibia, and patella. An additional 2 landmarks were added manually. Two landmark reviewers separately reviewed all 30 scans and verified all landmarks. After 2 weeks, one reviewer repeated the process for the same dataset. The mean Euclidean distance of manual landmark displacement, mean absolute disagreement between and within raters, and intraclass correlation coefficients for inter- and intra-rater reliability were calculated. ResultsAll landmarks had excellent inter-rater reliability. The tibial and femoral shaft centers had ICCs of 1, indicating their positions did not differ. Seventeen landmarks had ICCs between 0.90-0.99 and the tibial tuberosity had an ICC of 0.87. Intra-rater reliability scores were 1 for 16 landmarks and between 0.90-0.99 for the remaining five. ConclusionThere was excellent agreement on the locations of all 21 landmarks evaluated in this study. Clinical RelevanceThe potential role of artificial intelligence in medical imaging and orthopedic research is a growing area of interest. The excellent reliability demonstrated across multiple landmarks in our study reveals the potential for semi-automated 3D methods to enhance precision of anatomical measurements of the knee over 2D methods.
Read full abstract