Mortality Risk Score Prediction in an Elderly Population Using Machine Learning

Sherri Rose

doi:10.1093/aje/kws241

Abstract

Standard practice for prediction often relies on parametric regression methods. Interesting new methods from the machine learning literature have been introduced in epidemiologic studies, such as random forest and neural networks. However, a priori, an investigator will not know which algorithm to select and may wish to try several. Here I apply the super learner, an ensembling machine learning approach that combines multiple algorithms into a single algorithm and returns a prediction function with the best cross-validated mean squared error. Super learning is a generalization of stacking methods. I used super learning in the Study of Physical Performance and Age-Related Changes in Sonomans (SPPARCS) to predict death among 2,066 residents of Sonoma, California, aged 54 years or more during the period 1993-1999. The super learner for predicting death (risk score) improved upon all single algorithms in the collection of algorithms, although its performance was similar to that of several algorithms. Super learner outperformed the worst algorithm (neural networks) by 44% with respect to estimated cross-validated mean squared error and had an R2 value of 0.201. The improvement of super learner over random forest with respect to R2 was approximately 2-fold. Alternatives for risk score prediction include the super learner, which can provide improved performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mortality Risk Score Prediction in an Elderly Population Using Machine Learning

Abstract

Talk to us

Similar Papers

More From: American Journal of Epidemiology

Lead the way for us

Journal: American Journal of Epidemiology	Publication Date: Jan 29, 2013
Citations: 175

Similar Papers

604 Predicting adverse pregnancy outcomes in women with systemic lupus erythematosus: external validation of the promisse model using multiple independent cohorts
Melissa Fazzari ... Jane Salmon
Lupus Science & Medicine | VOL. 9
Melissa Fazzari, et. al.Melissa Fazzari ... Jane Salmon
01 Dec 2022
Lupus Science & Medicine | VOL. 9

Can Hyperparameter Tuning Improve the Performance of a Super Learner?: A Case Study.
Jenna Wong ... Travis Manderson
Epidemiology | VOL. 30
Jenna Wong, et. al.Jenna Wong ... Travis Manderson
03 Jun 2019
Epidemiology | VOL. 30

Mortality prediction in intensive care units with the Super ICU Learner Algorithm (SICULA): a population-based study
Romain Pirracchio ... Mark J Van Der Laan
The Lancet Respiratory Medicine | VOL. 3
Romain Pirracchio, et. al.Romain Pirracchio ... Mark J Van Der Laan
24 Nov 2014
The Lancet Respiratory Medicine | VOL. 3

Super Learning: An Application to the Prediction of HIV-1 Drug Resistance
Sandra E Sinisi ... Soo-Yon Rhee
Statistical Applications in Genetics and Molecular Biology | VOL. 6
Sandra E Sinisi, et. al.Sandra E Sinisi ... Soo-Yon Rhee
23 Jan 2007
Statistical Applications in Genetics and Molecular Biology | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mortality Risk Score Prediction in an Elderly Population Using Machine Learning

Abstract

Talk to us

Similar Papers

More From: American Journal of Epidemiology