PurposeTo develop machine learning (ML) and multivariable regression models to predict ipsilateral breast event (IBE) risk after ductal carcinoma in situ (DCIS) treatment. MethodsA retrospective investigation was conducted of patients diagnosed with DCIS from 2007 to 2014 who were followed for a minimum of five years after treatment. Data about each patient were extracted from the medical records. Two ML models (penalized logistic regression and random forest) and a multivariable logistic regression model were developed to evaluate recurrence-related variables. Results650 women (mean age 56 years, range 27–87 years) underwent treatment for DCIS and were followed for at least five years after treatment (mean 8.0 years). 5.5% (n = 36) experienced an IBE. With multivariable analysis, the variables associated with higher IBE risk were younger age (adjusted odds ratio [aOR] 0.96, p = 0.02), dense breasts at mammography (aOR 3.02, p = 0.02), and < 5 years of endocrine therapy (aOR 4.48, p = 0.02). The multivariable regression model to predict IBE risk achieved an area under the receiver operating characteristic curve (AUC) of 0.75 (95% CI 0.67–0.84). The penalized logistic regression and random forest models achieved mean AUCs of 0.52 (95% CI 0.42–0.61) and 0.54 (95% CI 0.43–0.65), respectively. ConclusionVariables associated with higher IBE risk after DCIS treatment include younger age, dense breasts, and <5 years of adjuvant endocrine therapy. The multivariable logistic regression model attained the highest AUC (0.75), suggesting that regression models have a critical role in risk prediction for patients with DCIS.
Read full abstract7-days of FREE Audio papers, translation & more with Prime
7-days of FREE Prime access