Abstract
Cardiovascular diseases (CVDs) are the leading cause of death worldwide, representing a significant public health challenge. Myocardial Infarction (MI), a severe manifestation of CVDs, contributes substantially to these fatalities. Machine learning holds great promise for predicting MI. This study explores the potential of Gradient Boosting (GB) techniques for this purpose, explicitly focusing on CatBoost, LightGBM, XGBoost, and XGBoost Random Forest. The study leverages GB's embedded feature selection, missing-value handling, and hyperparameter tuning capabilities. Performance was evaluated using multiple metrics: Area Under the Curve (AUC), classification accuracy, F1 score, precision, recall, and Matthews Correlation Coefficient (MCC). A probabilistic comparison matrix was used to assess the relative performance of the GB models. The results demonstrate the superiority of CatBoost, achieving a classification accuracy of 94.9%, an AUC of 0.992, a recall of 94.9%, and an MCC of 0.82. The probabilistic comparison further confirms CatBoost's superior performance. These findings contribute to MI prediction, highlighting the predictive potential of the CatBoost algorithm and ultimately aiding the fight against MI to achieve better patient outcomes.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have