The roles of predictors in cardiovascular risk models - a question of modeling culture?

Christine Wallisch,Georg Heinze,Matthias Samwald,Daniela Dunkler,Georg Dorffner,Asan Agibetov,Maria Haller

doi:10.1186/s12874-021-01487-4

Abstract

BackgroundWhile machine learning (ML) algorithms may predict cardiovascular outcomes more accurately than statistical models, their result is usually not representable by a transparent formula. Hence, it is often unclear how specific values of predictors lead to the predictions. We aimed to demonstrate with graphical tools how predictor-risk relations in cardiovascular risk prediction models fitted by ML algorithms and by statistical approaches may differ, and how sample size affects the stability of the estimated relations.MethodsWe reanalyzed data from a large registry of 1.5 million participants in a national health screening program. Three data analysts developed analytical strategies to predict cardiovascular events within 1 year from health screening. This was done for the full data set and with gradually reduced sample sizes, and each data analyst followed their favorite modeling approach. Predictor-risk relations were visualized by partial dependence and individual conditional expectation plots.ResultsWhen comparing the modeling algorithms, we found some similarities between these visualizations but also occasional divergence. The smaller the sample size, the more the predictor-risk relation depended on the modeling algorithm used, and also sampling variability played an increased role. Predictive performance was similar if the models were derived on the full data set, whereas smaller sample sizes favored simpler models.ConclusionPredictor-risk relations from ML models may differ from those obtained by statistical models, even with large sample sizes. Hence, predictors may assume different roles in risk prediction models. As long as sample size is sufficient, predictive accuracy is not largely affected by the choice of algorithm.

Highlights

While machine learning (ML) algorithms may predict cardiovascular outcomes more accurately than statistical models, their result is usually not representable by a transparent formula
Used cardiovascular disease (CVD) risk prediction models such as the Framingham 2008 CVD risk model were statistically estimated by fitting a Cox model with a relatively small number of coefficients [1]
Predictors Similar to the Framingham 2008 CVD risk model, we considered the following predictors: sex, age, total cholesterol, High density lipoprotein (HDL) cholesterol, systolic blood pressure (BP, mmHg), hypertensive drug intake, diabetes, and smoking status

Summary

Introduction

While machine learning (ML) algorithms may predict cardiovascular outcomes more accurately than statistical models, their result is usually not representable by a transparent formula. It is often unclear how specific values of predictors lead to the predictions. An important caveat of many ML algorithms is that the final model structure is non-transparent and predictions seem to be generated by a ‘black-box’. This impedes reproducibility as well as quantification of a particular predictor-risk relation. Several techniques have been proposed [9, 10], and some of them have been denoted as ‘model-agnostic’ as they can be applied without knowing how a modeling algorithm arrives at predictions

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Medical Research Methodology	Publication Date: Dec 1, 2021
Citations: 4	License type: open-access

R Discovery Prime

R Discovery Prime

The roles of predictors in cardiovascular risk models - a question of modeling culture?

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Research Methodology

Lead the way for us

Similar Papers

Comparison Between Statistical Model and Machine Learning Methods for Predicting the Risk of Renal Function Decline Using Routine Clinical Data in Health Screening.
Xia Cao ... Binfang Yang
Risk Management and Healthcare Policy | VOL. 15
Xia Cao, et. al.Xia Cao ... Binfang Yang
01 Apr 2022
Risk Management and Healthcare Policy | VOL. 15

Machine and deep learning algorithms for classifying different types of dementia: A literature review
Masoud Noroozi ... Niloofar Deravi
Applied Neuropsychology: Adult | VOL. ahead-of-print
Masoud Noroozi, et. al.Masoud Noroozi ... Niloofar Deravi
31 Jul 2024
Applied Neuropsychology: Adult | VOL. ahead-of-print

Unleashing the Power of Machine Learning to Predict Myocardial Recovery After Left Ventricular Assist Device: A Call for the Inclusion of Unstructured Data Sources in Heart Failure Registries.
Ramsey M Wehbe
Circulation. Heart failure | VOL. 15
Ramsey M WehbeRamsey M Wehbe
24 Dec 2021
Circulation. Heart failure | VOL. 15

Performance of Machine Learning Algorithms in Predicting the Pavement International Roughness Index
Mohammad Z Bashar ... Cristina Torres-Machi
Transportation Research Record: Journal of the Transportation Research Board | VOL. 2675
Mohammad Z Bashar, et. al.Mohammad Z Bashar ... Cristina Torres-Machi
19 Jan 2021
Transportation Research Record: Journal of the Transportation Research Board | VOL. 2675

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The roles of predictors in cardiovascular risk models - a question of modeling culture?

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Research Methodology