Abstract

To compare machine learning (ML) models with logistic regression model in order to identify the optimal factors associated with mammography-occult (i.e. false-negative mammographic findings) magnetic resonance imaging (MRI)-detected newly diagnosed breast cancer (BC). The present single-centre retrospective study included consecutive women with BC who underwent mammography and MRI (no more than 45days apart) for breast cancer between January 2018 and May 2023. Various ML algorithms and binary logistic regression analysis were utilized to extract features linked to mammography-occult BC. These features were subsequently employed to create different models. The predictive value of these models was assessed using receiver operating characteristic curve analysis. This study included 1957 malignant lesions from 1914 patients, with an average age of 51.64 ± 9.92years and a range of 20-86years. Among these lesions, there were 485 mammography-occult BCs. The optimal features of mammography-occult BC included calcification status, tumour size, mammographic density, age, lesion enhancement type on MRI, and histological type. Among the different ML models (ANN, L1-LR, RF, and SVM) and the LR-based combined model, the ANN model with RF features was found to be the optimal model. It demonstrated the best discriminative performance in predicting mammography false- negative findings, with an AUC of 0.912, an accuracy of 86.90%, a sensitivity of 85.85%, and a specificity of 84.18%. Mammography-occult MRI-detected breast cancers have features that should be considered when performing breast MRI to improve the detection rate for breast cancer and aid in clinician management.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call