Abstract Background Accurate prediction of mortality following an ischemic stroke is essential for tailoring personalized treatment strategies. This study evaluates the effectiveness of machine learning models in predicting one-year mortality after an ischemic stroke. Methods Five machine learning models were trained using data from a national stroke registry, with logistic regression demonstrating the highest performance. The SHapley Additive exPlanations (SHAP) analysis explained the model’s outcomes and defined the influential predictive factors. Results Analyzing 8183 ischemic stroke patients, logistic regression achieved 83% accuracy, 0.89 AUC, and an F1 score of 0.83. Significant predictors included stroke severity, pre-stroke functional status, age, hospital-acquired pneumonia, ischemic stroke subtype, tobacco use, and co-existing diabetes mellitus (DM). Discussion The model highlights the importance of predicting mortality in enhancing personalized stroke care. Apart from pneumonia, all predictors can serve the early prediction of mortality risk which supports the initiation of early preventive measures and in setting realistic expectations of disease outcomes for all stakeholders. The identified tobacco paradox warrants further investigation. Conclusion This study offers a promising tool for early prediction of stroke mortality and for advancing personalized stroke care. It emphasizes the need for prospective studies to validate these findings in diverse clinical settings.
Read full abstract