Tailoring Risk Prediction Models to Local Populations.

Aniket N Zinzuwadia,Olga Mineeva,Chunying Li,Zareen Farukhi,Franco Giulianini,Brian Cade,Lin Chen,Elizabeth Karlson,Nina Paynter,Samia Mora,Olga Demler

doi:10.1001/jamacardio.2024.2912

Abstract

Risk estimation is an integral part of cardiovascular care. Local recalibration of guideline-recommended models could address the limitations of existing tools. To provide a machine learning (ML) approach to augment the performance of the American Heart Association's Predicting Risk of Cardiovascular Disease Events (AHA-PREVENT) equations when applied to a local population while preserving clinical interpretability. This cohort study used a New England-based electronic health record cohort of patients without prior atherosclerotic cardiovascular disease (ASCVD) who had the data necessary to calculate the AHA-PREVENT 10-year risk of developing ASCVD in the event period (2007-2016). Patients with prior ASCVD events, death prior to 2007, or age 79 years or older in 2007 were subsequently excluded. The final study population of 95 326 patients was split into 3 nonoverlapping subsets for training, testing, and validation. The AHA-PREVENT model was adapted to this local population using the open-source ML model (MLM) Extreme Gradient Boosting model (XGBoost) with minimal predictor variables, including age, sex, and AHA-PREVENT. The MLM was monotonically constrained to preserve known associations between risk factors and ASCVD risk. Along with sex, race and ethnicity data from the electronic health record were collected to validate the performance of ASCVD risk prediction in subgroups. Data were analyzed from August 2021 to February 2024. Consistent with the AHA-PREVENT model, ASCVD events were defined as the first occurrence of either nonfatal myocardial infarction, coronary artery disease, ischemic stroke, or cardiovascular death. Cardiovascular death was coded via government registries. Discrimination, calibration, and risk reclassification were assessed using the Harrell C index, a modified Hosmer-Lemeshow goodness-of-fit test and calibration curves, and reclassification tables, respectively. In the test set of 38 137 patients (mean [SD] age, 64.8 [6.9] years, 22 708 [59.5]% women and 15 429 [40.5%] men; 935 [2.5%] Asian, 2153 [5.6%] Black, 1414 [3.7%] Hispanic, 31 400 [82.3%] White, and 2235 [5.9%] other, including American Indian, multiple races, unspecified, and unrecorded, consolidated owing to small numbers), MLM-PREVENT had improved calibration (modified Hosmer-Lemeshow P > .05) compared to the AHA-PREVENT model across risk categories in the overall cohort (χ23 = 2.2; P = .53 vs χ23 > 16.3; P < .001) and sex subgroups (men: χ23 = 2.1; P = .55 vs χ23 > 16.3; P < .001; women: χ23 = 6.5; P = .09 vs. χ23 > 16.3; P < .001), while also surpassing a traditional recalibration approach. MLM-PREVENT maintained or improved AHA-PREVENT's calibration in Asian, Black, and White individuals. Both MLM-PREVENT and AHA-PREVENT performed equally well in discriminating risk (approximate ΔC index, ±0.01). Using a clinically significant 7.5% risk threshold, MLM-PREVENT reclassified a total of 11.5% of patients. We visualize the recalibration through MLM-PREVENT ASCVD risk charts that highlight preserved risk associations of the original AHA-PREVENT model. The interpretable ML approach presented in this article enhanced the accuracy of the AHA-PREVENT model when applied to a local population while still preserving the risk associations found by the original model. This method has the potential to recalibrate other established risk tools and is implementable in electronic health record systems for improved cardiovascular risk assessment.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Tailoring Risk Prediction Models to Local Populations.

Abstract

Talk to us

Similar Papers

More From: JAMA cardiology

Lead the way for us

Similar Papers

Application of a Lifestyle-Based Tool to Estimate Premature Cardiovascular Disease Events in Young Adults
Holly C Gooding ... Matthew W Gillman
JAMA internal medicine | VOL. 177
Holly C Gooding, et. al.Holly C Gooding ... Matthew W Gillman
17 Jul 2017
JAMA internal medicine | VOL. 177

Abstract 17106: Prediction of Recurrent Atherosclerotic Cardiovascular Disease Risk Using Machine Learning and Electronic Health Record Data
Ashish Sarraju ... Sukyung Chung
Circulation | VOL. 142
Ashish Sarraju, et. al.Ashish Sarraju ... Sukyung Chung
17 Nov 2020
Circulation | VOL. 142

Longitudinal Plasma Measures of Trimethylamine N-Oxide and Risk of Atherosclerotic Cardiovascular Disease Events in Community-Based Older Adults.
Yujin Lee ... Rozenn N Lemaitre
Journal of the American Heart Association | VOL. 10
Yujin Lee, et. al.Yujin Lee ... Rozenn N Lemaitre
16 Aug 2021
Journal of the American Heart Association | VOL. 10

Machine learning and atherosclerotic cardiovascular disease risk prediction in a multi-ethnic population
Andrew Ward ... Latha Palaniappan
npj Digital Medicine | VOL. 3
Andrew Ward, et. al.Andrew Ward ... Latha Palaniappan
23 Sep 2020
npj Digital Medicine | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Tailoring Risk Prediction Models to Local Populations.

Abstract

Talk to us

Similar Papers

More From: JAMA cardiology