BackgroundMachine learning (ML) models have the potential to accurately predict outcomes and offer novel insights into inter-variable correlations. In this study, we aimed to design ML models for the prediction of 1-year mortality after percutaneous coronary intervention (PCI) in patients with acute coronary syndrome. MethodsThis study was performed on 13,682 patients at Tehran Heart Center from 2015 to 2021. Patients were split into 70:30 for testing and training. Four ML models were designed: a traditional Logistic Regression (LR) model, Random Forest (RF), Extreme Gradient Boosting (XGBoost), and Ada Boost models. The importance of features was calculated using the RF feature selector and SHAP based on the XGBoost model. The Area Under the Receiver Operating Characteristic Curve (AUC-ROC) for the prediction on the testing dataset was the main measure of the model's performance. ResultsFrom a total of 9,073 patients with >1-year follow-up, 340 participants died. Higher age and higher rates of comorbidities were observed in these patients. Body mass index and lipid profile demonstrated a U-shaped correlation with the outcome. Among the models, RF had the best discrimination (AUC 0.866), while the highest sensitivity (80.9%) and specificity (88.3%) were for LR and XGBoost models, respectively. All models had AUCs of >0.8. ConclusionML models can predict 1-year mortality after PCI with high performance. A classic LR statistical approach showed comparable results with other ML models. The individual-level assessment of inter-variable correlations provided new insights into the non-linear contribution of risk factors to post-PCI mortality.
Read full abstract