Strategic Machine Learning Optimization for Cardiovascular Disease Prediction and High-Risk Patient Identification

Konstantina-Vasiliki Tompra,George Papageorgiou,Christos Tjortjis

doi:10.3390/a17050178

Abstract

Despite medical advancements in recent years, cardiovascular diseases (CVDs) remain a major factor in rising mortality rates, challenging predictions despite extensive expertise. The healthcare sector is poised to benefit significantly from harnessing massive data and the insights we can derive from it, underscoring the importance of integrating machine learning (ML) to improve CVD prevention strategies. In this study, we addressed the major issue of class imbalance in the Behavioral Risk Factor Surveillance System (BRFSS) 2021 heart disease dataset, including personal lifestyle factors, by exploring several resampling techniques, such as the Synthetic Minority Oversampling Technique (SMOTE), Adaptive Synthetic Sampling (ADASYN), SMOTE-Tomek, and SMOTE-Edited Nearest Neighbor (SMOTE-ENN). Subsequently, we trained, tested, and evaluated multiple classifiers, including logistic regression (LR), decision trees (DTs), random forest (RF), gradient boosting (GB), XGBoost (XGB), CatBoost, and artificial neural networks (ANNs), comparing their performance with a primary focus on maximizing sensitivity for CVD risk prediction. Based on our findings, the hybrid resampling techniques outperformed the alternative sampling techniques, and our proposed implementation includes SMOTE-ENN coupled with CatBoost optimized through Optuna, achieving a remarkable 88% rate for recall and 82% for the area under the receiver operating characteristic (ROC) curve (AUC) metric.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Strategic Machine Learning Optimization for Cardiovascular Disease Prediction and High-Risk Patient Identification

Abstract

Talk to us

Similar Papers

More From: Algorithms

Lead the way for us

Journal: Algorithms	Publication Date: Apr 26, 2024
License type: CC BY 4.0

Similar Papers

Computational Model for Prediction of Malignant Mesothelioma Diagnosis
Surbhi Gupta ... Manoj Kumar Gupta
The Computer Journal | VOL. 66
Surbhi Gupta, et. al.Surbhi Gupta ... Manoj Kumar Gupta
09 Oct 2021
The Computer Journal | VOL. 66

Applying machine learning methods to predict geology using soil sample geochemistry
Timothy C.C Lui ... Sharon A Cowling
Applied Computing and Geosciences | VOL. 16
Timothy C.C Lui, et. al.Timothy C.C Lui ... Sharon A Cowling
11 Aug 2022
Applied Computing and Geosciences | VOL. 16

Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage
Jianxiang Tang ... Hongli Wan
BMC Medical Informatics and Decision Making | VOL. 22
Jianxiang Tang, et. al.Jianxiang Tang ... Hongli Wan
25 Oct 2022
BMC Medical Informatics and Decision Making | VOL. 22

A two-stage modeling approach for breast cancer survivability prediction
Zahra Sedighi-Maman ... Alexa Mondello
International Journal of Medical Informatics | VOL. 149
Zahra Sedighi-Maman, et. al.Zahra Sedighi-Maman ... Alexa Mondello
11 Mar 2021
International Journal of Medical Informatics | VOL. 149

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Strategic Machine Learning Optimization for Cardiovascular Disease Prediction and High-Risk Patient Identification

Abstract

Talk to us

Similar Papers

More From: Algorithms