Abstract

273 Background: To date, studies of machine learning (ML) algorithms within oncology for mortality prediction have focused on structured electronic health record (EHR) data. Given the complex symptom burden of patients with advanced cancers, ML models may be better suited to identify patterns and interactions between symptom burden and outcomes compared to traditional statistical methods. To that end, in this study, we leverage the patient reported outcomes (PRO) data together with clinical EHR-based variables to assess the performance of ML algorithms to predict mortality in patients with advanced cancers. Methods: We randomly selected 689 patients with advanced cancer who had their first Palliative Care encounter between January 2012 and December 2017. 59 patients were lost to follow-up and were excluded from this analysis. The remaining cohort of 630 patients was split 4:1 randomly into a training and validation set to develop and test a supervised ML algorithm (Extreme Gradient Boosting [XGB] tree) to predict the 6-month mortality. Candidate variables for algorithm development included gender, age, ECOG performance status (PS), number of prior systemic therapies, and scores on the Edmonton Symptom Assessment System (ESAS)-FS, a 12-item PRO measure of physical and psychosocial symptom burden include the composite Physical Symptom Score (PHS), a sum of the physical ESAS symptoms (pain, fatigue, nausea, drowsiness, shortness of breath, appetite, wellbeing, sleep). Results: Overall, 630 patients were included in this 6-month mortality prediction; mean age 59 years, 354 (56%) female; 276 (44%) male. Variables with the most significant impact on the XGB tree mortality prediction were the ESAS symptoms of shortness of breath (1-AUC, 0.295), appetite, ESAS PHS, financial distress, age, and appetite as well as ECOG PS and number of prior systemic therapies. The XGB tree algorithm demonstrated the best overall prediction performance of 6-month mortality in the independent testing set, AUC 0.716 (95% CI 0.63 - 0.81), sensitivity 0.75 (95% CI 0.66 - 0.87), and a positive predictive value 0.67 (95% CI 0.57 - 0.79). Conclusions: Our ML model leveraged PRO-based assessment of symptom burden to correctly identify the majority of patients who died within 6 months. These models are uniquely positioned to not only automatically identify patients at high risk for short-term mortality but also the specific symptoms of concern for clinical intervention. Such models can be applied to available clinical and PRO data to facilitate clinical decision-making. Futures studies on improving model performance with the inclusion of interventions to modify symptom burden are in design.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call