Objective: We aimed to compare the clinical utility and diagnostic accuracy of the ADNEX model, ROMA score, RMI I, and RMI IV, as well as two serum markers (CA125 and HE4) in preoperative discrimination between benign and malignant adnexal masses (AMs). Methods: We conducted a retrospective study extracting all consecutive patients with AMs seen at our Institution between January 2015 and December 2020. Accuracy metrics included sensitivity (SE), specificity (SP), and area under the receiver operating characteristic curve (AUC), and their 95% confidence intervals (CI) were calculated for basic discrimination between AMs. Model performance was evaluated in terms of discrimination ability and clinical utility (net benefit, NB). Results: A total of 581 women were included; 481 (82.8%) had a benign ovarian tumor and 100 (17.2%) had a malignant tumor. The SE and SP of CA125, HE4, ROMA score, RMI I, RMI IV, and ADNEX model were 0.60 (0.54–0.66) and 0.80 (0.76–0.83); 0.39 (0.30–0.49) and 0.96 (0.94–0.98); 0.59 (0.50–0.68) and 0.92 (0.88–0.95); 0.56 (0.46–0.65) and 0.98 (0.96–0.99); 0.54 (0.44–0.63) and 0.96 (0.94–0.98); 0.82 (0.73–0.88) and 0.91 (0.89–0.94), respectively. The overall AUC was 0.76 (0.74–0.79) for CA125, 0.81 (0.78–0.83) for HE4, 0.82 (0.80–0.85) for ROMA, 0.86 (0.84–0.88) for RMI I, 0.83 (0.81–0.86) for RMI IV, and 0.92 (0.90–0.94) for ADNEX. The NB for ADNEX was higher than other biomarkers and models across all decision thresholds between 5% and 50%. Conclusions: The ADNEX model showed a better discrimination ability and clinical utility when differentiating malignant from benign Ams, compared to CA125, HE4, ROMA score, RMI I, and RMI IV.
Read full abstract