Development, Validation, and Evaluation of a Simple Machine Learning Model to Predict Cirrhosis Mortality

Fasiha Kanwal,Jennifer R Kramer,Allen L Gifford,Hashem B El-Serag,Thomas J Taylor,Aanand D Naik,Donna Smith,Yumei Cao,Steven M Asch

doi:10.1001/jamanetworkopen.2020.23780

Abstract

Machine-learning algorithms offer better predictive accuracy than traditional prognostic models but are too complex and opaque for clinical use. To compare different machine learning methods in predicting overall mortality in cirrhosis and to use machine learning to select easily scored clinical variables for a novel cirrhosis prognostic model. This prognostic study used a retrospective cohort of adult patients with cirrhosis or its complications seen in 130 hospitals and affiliated ambulatory clinics in the integrated, national Veterans Affairs health care system from October 1, 2011, to September 30, 2015. Patients were followed up through December 31, 2018. Data were analyzed from October 1, 2017, to May 31, 2020. Potential predictors included demographic characteristics; liver disease etiology, severity, and complications; use of health care resources; comorbid conditions; and comprehensive laboratory and medication data. Patients were randomly selected for model development (66.7%) and validation (33.3%). Three different statistical and machine learning methods were evaluated: gradient descent boosting, logistic regression with least absolute shrinkage and selection operator (LASSO) regularization, and logistic regression with LASSO constrained to select no more than 10 predictors (partial pathway model). Predictor inclusion and model performance were evaluated in a 5-fold cross-validation. Last, the predictors identified in the most parsimonious (the partial path) model were refit using maximum-likelihood estimation (Cirrhosis Mortality Model [CiMM]), and its predictive performance was compared with that of the widely used Model for End Stage Liver Disease with sodium (MELD-Na) score. All-cause mortality. Of the 107 939 patients with cirrhosis (mean [SD] age, 62.7 [9.6] years; 96.6% male; 66.3% white, 18.4% African American), the annual mortality rate ranged from 8.8% to 15.3%. In total, 32.7% of patients died within 3 years, and 46.2% died within 5 years after the index date. Models predicting 1-year mortality had good discrimination for the gradient descent boosting (area under the receiver operating characteristics curve [AUC], 0.81; 95% CI, 0.80-0.82), logistic regression with LASSO regularization (AUC, 0.78; 95% CI, 0.77-0.79), and the partial path logistic model (AUC, 0.78; 95% CI, 0.76-0.78). All models showed good calibration. The final CiMM model with machine learning-derived clinical variables offered significantly better discrimination than the MELD-Na score, with AUCs of 0.78 (95% CI, 0.77-0.79) vs 0.67 (95% CI, 0.66-0.68) for 1-year mortality, respectively (DeLong z = 17.00; P < .001). In this study, simple machine learning techniques performed as well as the more advanced ensemble gradient boosting. Using the clinical variables identified from simple machine learning in a cirrhosis mortality model produced a new score more transparent than machine learning and more predictive than the MELD-Na score.

Highlights

Risk stratification is at the core of medical practice
The final cirrhosis mortality model (CiMM) model with machine learning–derived clinical variables offered significantly better discrimination than the MELD-Na score, with area under the receiver operating characteristics curve (AUC) of 0.78 vs 0.67 for 1-year mortality, respectively (DeLong z=17.00; P < .001)
Using the clinical variables identified from simple machine learning in a cirrhosis mortality model produced a new score more predictive than the traditional Model for End Stage Liver Disease with sodium

Summary

Introduction

Risk prediction scores routinely guide treatment across a range of medical decisions from anticoagulation[1,2] to lowering of cholesterol levels[3] to life-sustaining intensive care.[4] The most widely used scores include a limited number of measured variables, allowing for transparent calculation and interpretability but constraining their prognostic performance. Machine learning techniques have the potential to improve prognostication These techniques incorporate a large array of predictors in a nonlinear pattern and use multiple interactions to enhance accuracy. Their many variables and the complexity of scoring rules hinder their implementation in all but the most advanced informatics settings. A blended strategy that builds on the strengths of machine learning to develop simpler, clinically explainable risk scores may optimize the trade-off between accuracy vs interpretability and facilitate subsequent implementation

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: JAMA Network Open	Publication Date: Nov 3, 2020
Citations: 52	License type: cc-by

R Discovery Prime

R Discovery Prime

Development, Validation, and Evaluation of a Simple Machine Learning Model to Predict Cirrhosis Mortality

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: JAMA Network Open

Lead the way for us

Similar Papers

Unleashing the Power of Machine Learning to Predict Myocardial Recovery After Left Ventricular Assist Device: A Call for the Inclusion of Unstructured Data Sources in Heart Failure Registries.
Ramsey M Wehbe
Circulation. Heart failure | VOL. 15
Ramsey M WehbeRamsey M Wehbe
24 Dec 2021
Circulation. Heart failure | VOL. 15

Systematic Comparison of Machine Learning Methods for Identification of miRNA Species as Disease Biomarkers
Chihiro Higuchi ... Yukinori Okada
-
Chihiro Higuchi, et. al.Chihiro Higuchi ... Yukinori Okada
01 Jan 2015
01 Jan 2015

Use Machine Learning Methods to Explore the Associated Risk Factors of Osteoporosis or Bone Loss in Chinese People With Type 2 Diabetes：A Clinical Prediction Model
Yaqian Mao ... Bingwei He
SSRN Electronic Journal | VOL. -
Yaqian Mao, et. al.Yaqian Mao ... Bingwei He
11 Nov 2020
SSRN Electronic Journal | VOL. -

Machine Learning Methods Based on CT Features Differentiate G1/G2 From G3 Pancreatic Neuroendocrine Tumors
Hai-Yan Chen ... Guo-Liang Shao
Academic radiology | VOL. 31
Hai-Yan Chen, et. al.Hai-Yan Chen ... Guo-Liang Shao
04 Dec 2023
Academic radiology | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Development, Validation, and Evaluation of a Simple Machine Learning Model to Predict Cirrhosis Mortality

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: JAMA Network Open