The relative data hungriness of unpenalized and penalized logistic regression and ensemble-based machine learning methods: the case of calibration.

Peter C Austin,Douglas S Lee,Bo Wang

doi:10.1186/s41512-024-00179-z

Abstract

Machine learning methods are increasingly being used to predict clinical outcomes. Optimism is the difference in model performance between derivation and validation samples. The term "data hungriness" refers to the sample size needed for a modelling technique to generate a prediction model with minimal optimism. Our objective was to compare the relative data hungriness of different statistical and machine learning methods when assessed using calibration. We used Monte Carlo simulations to assess the effect of number of events per variable (EPV) on the optimism of six learning methods when assessing model calibration: unpenalized logistic regression, ridge regression, lasso regression, bagged classification trees, random forests, and stochastic gradient boosting machines using trees as the base learners. We performed simulations in two large cardiovascular datasets each of which comprised an independent derivation and validation sample: patients hospitalized with acute myocardial infarction and patients hospitalized with heart failure. We used six data-generating processes, each based on one of the six learning methods. We allowed the sample sizes to be such that the number of EPV ranged from 10 to 200 in increments of 10. We applied six prediction methods in each of the simulated derivation samples and evaluated calibration in the simulated validation samples using the integrated calibration index, the calibration intercept, and the calibration slope. We also examined Nagelkerke's R2, the scaled Brier score, and the c-statistic. Across all 12 scenarios (2 diseases × 6 data-generating processes), penalized logistic regression displayed very low optimism even when the number of EPV was very low. Random forests and bagged trees tended to be the most data hungry and displayed the greatest optimism. When assessed using calibration, penalized logistic regression was substantially less data hungry than methods from the machine learning literature.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The relative data hungriness of unpenalized and penalized logistic regression and ensemble-based machine learning methods: the case of calibration.

Abstract

Talk to us

Similar Papers

More From: Diagnostic and prognostic research

Lead the way for us

Journal: Diagnostic and prognostic research	Publication Date: Nov 5, 2024
License type: CC BY-NC-ND 4.0

Similar Papers

Predictive performance of machine and statistical learning methods: Impact of data-generating processes on external validity in the "large N, small p" setting.
Peter C Austin ... Ewout W Steyerberg
Statistical Methods in Medical Research | VOL. 30
Peter C Austin, et. al.Peter C Austin ... Ewout W Steyerberg
13 Apr 2021
Statistical Methods in Medical Research | VOL. 30

Empirical analyses and simulations showed that different machine and statistical learning methods had differing performance for predicting blood pressure
Peter C Austin ... Ewout W Steyerberg
Scientific Reports | VOL. 12
Peter C Austin, et. al.Peter C Austin ... Ewout W Steyerberg
03 Jun 2022
Scientific Reports | VOL. 12

Editor's evaluation: Derivation and external validation of clinical prediction rules identifying children at risk of linear growth faltering
Eduardo Franco
-
Eduardo FrancoEduardo Franco
05 Sep 2022
05 Sep 2022

Decision letter: Derivation and external validation of clinical prediction rules identifying children at risk of linear growth faltering
Andrew N Mertens ... Eduardo Franco
-
Andrew N Mertens, et. al.Andrew N Mertens ... Eduardo Franco
05 Sep 2022
05 Sep 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The relative data hungriness of unpenalized and penalized logistic regression and ensemble-based machine learning methods: the case of calibration.

Abstract

Talk to us

Similar Papers

More From: Diagnostic and prognostic research