Clinical Risk Prediction Research Articles

Machine learning has been used to analyse heart failure subtypes, but not across large, distinct, population-based datasets, across the whole spectrum of causes and presentations, or with clinical and non-clinical validation by different machine learning methods. Using our published framework, we aimed to discover heart failure subtypes and validate them upon population representative data. In this external, prognostic, and genetic validation study we analysed individuals aged 30 years or older with incident heart failure from two population-based databases in the UK (Clinical Practice Research Datalink [CPRD] and The Health Improvement Network [THIN]) from 1998 to 2018. Pre-heart failure and post-heart failure factors (n=645) included demographic information, history, examination, blood laboratory values, and medications. We identified subtypes using four unsupervised machine learning methods (K-means, hierarchical, K-Medoids, and mixture model clustering) with 87 of 645 factors in each dataset. We evaluated subtypes for (1) external validity (across datasets); (2) prognostic validity (predictive accuracy for 1-year mortality); and (3) genetic validity (UK Biobank), association with polygenic risk score (PRS) for heart failure-related traits (n=11), and single nucleotide polymorphisms (n=12). We included 188 800, 124 262, and 9573 individuals with incident heart failure from CPRD, THIN, and UK Biobank, respectively, between Jan 1, 1998, and Jan 1, 2018. After identifying five clusters, we labelled heart failure subtypes as (1) early onset, (2) late onset, (3) atrial fibrillation related, (4) metabolic, and (5) cardiometabolic. In the external validity analysis, subtypes were similar across datasets (c-statistics: THIN model in CPRD ranged from 0·79 [subtype 3] to 0·94 [subtype 1], and CPRD model in THIN ranged from 0·79 [subtype 1] to 0·92 [subtypes 2 and 5]). In the prognostic validity analysis, 1-year all-cause mortality after heart failure diagnosis (subtype 1 0·20 [95% CI 0·14-0·25], subtype 2 0·46 [0·43-0·49], subtype 3 0·61 [0·57-0·64], subtype 4 0·11 [0·07-0·16], and subtype 5 0·37 [0·32-0·41]) differed across subtypes in CPRD and THIN data, as did risk of non-fatal cardiovascular diseases and all-cause hospitalisation. In the genetic validity analysis the atrial fibrillation-related subtype showed associations with the related PRS. Late onset and cardiometabolic subtypes were the most similar and strongly associated with PRS for hypertension, myocardial infarction, and obesity (p<0·0009). We developed a prototype app for routine clinical use, which could enable evaluation of effectiveness and cost-effectiveness. Across four methods and three datasets, including genetic data, in the largest study of incident heart failure to date, we identified five machine learning-informed subtypes, which might inform aetiological research, clinical risk prediction, and the design of heart failure trials. European Union Innovative Medicines Initiative-2.

Read full abstract

IntroductionThe prevalence of end-stage renal disease has raised the need for renal replacement therapy over recent decades. Even though a kidney transplant offers an improved quality of life and lower cost of care than dialysis, graft failure is possible after transplantation. Hence, this study aimed to predict the risk of graft failure among post-transplant recipients in Ethiopia using the selected machine learning prediction models.MethodologyThe data was extracted from the retrospective cohort of kidney transplant recipients at the Ethiopian National Kidney Transplantation Center from September 2015 to February 2022. In response to the imbalanced nature of the data, we performed hyperparameter tuning, probability threshold moving, tree-based ensemble learning, stacking ensemble learning, and probability calibrations to improve the prediction results. Merit-based selected probabilistic (logistic regression, naive Bayes, and artificial neural network) and tree-based ensemble (random forest, bagged tree, and stochastic gradient boosting) models were applied. Model comparison was performed in terms of discrimination and calibration performance. The best-performing model was then used to predict the risk of graft failure.ResultsA total of 278 completed cases were analyzed, with 21 graft failures and 3 events per predictor. Of these, 74.8% are male, and 25.2% are female, with a median age of 37. From the comparison of models at the individual level, the bagged tree and random forest have top and equal discrimination performance (AUC-ROC = 0.84). In contrast, the random forest has the best calibration performance (brier score = 0.045). Under testing the individual model as a meta-learner for stacking ensemble learning, the result of stochastic gradient boosting as a meta-learner has the top discrimination (AUC-ROC = 0.88) and calibration (brier score = 0.048) performance. Regarding feature importance, chronic rejection, blood urea nitrogen, number of post-transplant admissions, phosphorus level, acute rejection, and urological complications are the top predictors of graft failure.ConclusionsBagging, boosting, and stacking, with probability calibration, are good choices for clinical risk predictions working on imbalanced data. The data-driven probability threshold is more beneficial than the natural threshold of 0.5 to improve the prediction result from imbalanced data. Integrating various techniques in a systematic framework is a smart strategy to improve prediction results from imbalanced data. It is recommended for clinical experts in kidney transplantation to use the final calibrated model as a decision support system to predict the risk of graft failure for individual patients.

Read full abstract

Clinical Risk Prediction Research Articles

Related Topics

Articles published on Clinical Risk Prediction

A Systematic Analysis of the Clinical Outcome Associated with Multiple Reclassified Desmosomal Gene Variants in Arrhythmogenic Right Ventricular Cardiomyopathy Patients

Multi-Omic Biomarkers Improve Indeterminate Pulmonary Nodule Malignancy Risk Assessment.

Utility of Preoperative N-Terminal Pro-B-Type Natriuretic Peptide in the Prognosis of Coronary Artery Bypass Grafting

POLYGENIC RISK SCORES AND THEIR PHENOTYPIC ADJUSTMENT: NOVEL BIOMARKERS IN CARDIOVASCULAR DISEASE PREVENTION

Advances in positron emission tomography and radiomics.

Radiation-sensitive genetic prognostic model to identify individuals at risk for radiation resistance in head and neck squamous cell carcinoma.

Advancing heart failure research using machine learning

Identifying subtypes of heart failure from three electronic health record sources with machine learning: an external, prognostic, and genetic validation study

Development of Clinical Risk-prediction Models for Uterine Atony Following Vaginal and Cesarean Delivery

Classification of imbalanced data using machine learning algorithms to predict the risk of renal graft failures in Ethiopia

Comparison of likelihood penalization and variance decomposition approaches for clinical prediction models: A simulation study.

Development and validation of a multivariable risk prediction model for identifying ketosis-prone type 2 diabetes.

A clinical risk prediction tool for identifying the risk of violent offending in severe mental illness: A retrospective case-control study

Can change in phase angle predict the risk of morbidity and mortality during an 18-year follow-up period? A cohort study among adults.

Pandemic lockdown, isolation, and exit policies based on machine learning predictions.

The early use of Antibiotics for At-risk children with InfluEnza in Primary Care (the ARCHIE programme)

A machine learning approach to predicting 30-day mortality following paediatric cardiac surgery: findings from the Australia New Zealand Congenital Outcomes Registry for Surgery (ANZCORS).

A novel computer based risk prediction model for vocal cord palsy before thyroidectomy

Risk Prediction Models for Ischemic Cardiovascular Outcomes in Patients with Acute Coronary Syndrome.

Development of transdiagnostic clinical risk prediction models for 12-month onset and course of eating disorders among adolescents in the community.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Clinical Risk Prediction Research Articles

Related Topics

Articles published on Clinical Risk Prediction

A Systematic Analysis of the Clinical Outcome Associated with Multiple Reclassified Desmosomal Gene Variants in Arrhythmogenic Right Ventricular Cardiomyopathy Patients

Multi-Omic Biomarkers Improve Indeterminate Pulmonary Nodule Malignancy Risk Assessment.

Utility of Preoperative N-Terminal Pro-B-Type Natriuretic Peptide in the Prognosis of Coronary Artery Bypass Grafting

POLYGENIC RISK SCORES AND THEIR PHENOTYPIC ADJUSTMENT: NOVEL BIOMARKERS IN CARDIOVASCULAR DISEASE PREVENTION

Advances in positron emission tomography and radiomics.

Radiation-sensitive genetic prognostic model to identify individuals at risk for radiation resistance in head and neck squamous cell carcinoma.

Advancing heart failure research using machine learning

Identifying subtypes of heart failure from three electronic health record sources with machine learning: an external, prognostic, and genetic validation study

Development of Clinical Risk-prediction Models for Uterine Atony Following Vaginal and Cesarean Delivery

Classification of imbalanced data using machine learning algorithms to predict the risk of renal graft failures in Ethiopia

Comparison of likelihood penalization and variance decomposition approaches for clinical prediction models: A simulation study.

Development and validation of a multivariable risk prediction model for identifying ketosis-prone type 2 diabetes.

A clinical risk prediction tool for identifying the risk of violent offending in severe mental illness: A retrospective case-control study

Can change in phase angle predict the risk of morbidity and mortality during an 18-year follow-up period? A cohort study among adults.

Pandemic lockdown, isolation, and exit policies based on machine learning predictions.

The early use of Antibiotics for At-risk children with InfluEnza in Primary Care (the ARCHIE programme)

A machine learning approach to predicting 30-day mortality following paediatric cardiac surgery: findings from the Australia New Zealand Congenital Outcomes Registry for Surgery (ANZCORS).

A novel computer based risk prediction model for vocal cord palsy before thyroidectomy

Risk Prediction Models for Ischemic Cardiovascular Outcomes in Patients with Acute Coronary Syndrome.

Development of transdiagnostic clinical risk prediction models for 12-month onset and course of eating disorders among adolescents in the community.