ObjectiveBreast cancer has become the most prevalent malignant tumor in women, and the occurrence of distant metastasis signifies a poor prognosis. Utilizing predictive models to forecast distant metastasis in breast cancer presents a novel approach. This study aims to utilize readily available clinical data and advanced machine learning algorithms to establish an accurate clinical prediction model. The overall objective is to provide effective decision support for clinicians.MethodsData from 239 patients from two centers were analyzed, focusing on clinical blood biomarkers (tumor markers, liver and kidney function, lipid profile, cardiovascular markers). Spearman correlation and the least absolute shrinkage and selection operator regression were employed for feature dimension reduction. A predictive model was built using LightGBM and validated in training, testing, and external validation cohorts. Feature importance correlation analysis was conducted on the clinical model and the comprehensive model, followed by univariate and multivariate regression analysis of these features.ResultsThrough internal and external validation, we constructed a LightGBM model to predict de novo bone metastasis in newly diagnosed breast cancer patients. The area under the receiver operating characteristic curve values of this model in the training, internal validation test, and external validation test1 cohorts were 0.945, 0.892, and 0.908, respectively. Our validation results indicate that the model exhibits high sensitivity, specificity, and accuracy, making it the most accurate model for predicting bone metastasis in breast cancer patients. Carcinoembryonic Antigen, creatine kinase, albumin-globulin ratio, Apolipoprotein B, and Cancer Antigen 153 (CA153) play crucial roles in the model’s predictions. Lipoprotein a, CA153, gamma-glutamyl transferase, α-Hydroxybutyrate dehydrogenase, alkaline phosphatase, and creatine kinase are positively correlated with breast cancer bone metastasis, while white blood cell ratio and total cholesterol are negatively correlated.ConclusionThis study successfully utilized clinical blood biomarkers to construct an artificial intelligence model for predicting distant metastasis in breast cancer, demonstrating high accuracy. This suggests potential clinical utility in predicting and identifying distant metastasis in breast cancer. These findings underscore the potential prospect of developing economically efficient and readily accessible predictive tools in clinical oncology.
Read full abstract