Abstract
Pathological axillary lymph node (pALN) burden is an important factor for treatment decision-making in clinical T1-T2 (cT1-T2) stage breast cancer. Preoperative assessment of the pALN burden and prognosis aids in the individualized selection of therapeutic approaches. To develop and validate a machine learning (ML) model based on clinicopathological and MRI characteristics for assessing pALN burden and survival in patients with cT1-T2 stage breast cancer. Retrospective. A total of 506 females (range: 24-83 years) with cT1-T2 stage breast cancer from two institutions, forming the training (N = 340), internal validation (N = 85), and external validation cohorts (N = 81), respectively. This study used 1.5-T, axial fat-suppressed T2-weighted turbo spin-echo sequence and axial three-dimensional dynamic contrast-enhanced fat-suppressed T1-weighted gradient echo sequence. Four ML methods (eXtreme Gradient Boosting [XGBoost], Support Vector Machine, k-Nearest Neighbor, Classification and Regression Tree) were employed to develop models based on clinicopathological and MRI characteristics. The performance of these models was evaluated by their discriminative ability. The best-performing model was further analyzed to establish interpretability and used to calculate the pALN score. The relationships between the pALN score and disease-free survival (DFS) were examined. Chi-squared test, Fisher's exact test, univariable logistic regression, area under the curve (AUC), Delong test, net reclassification improvement, integrated discrimination improvement, Hosmer-Lemeshow test, log-rank, Cox regression analyses, and intraclass correlation coefficient were performed. A P-value <0.05 was considered statistically significant. The XGB II model, developed based on the XGBoost algorithm, outperformed the other models with AUCs of 0.805, 0.803, and 0.818 in the three cohorts. The Shapley additive explanation plot indicated that the top variable in the XGB II model was the Node Reporting and Data System score. In multivariable Cox regression analysis, the pALN score was significantly associated with DFS (hazard ratio: 4.013, 95% confidence interval: 1.059-15.207). The XGB II model may allow to evaluate pALN burden and could provide prognostic information in cT1-T2 stage breast cancer patients. 3 TECHNICAL EFFICACY: Stage 2.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.