Evaluating disease prediction models using a cohort whose covariate distribution differs from that of the target population.

Scott Powers,Leslie Bernstein,Alice S Whittemore,Valerie Mcguire,Alison J Canchola

doi:10.1177/0962280217723945

Abstract

Personal predictive models for disease development play important roles in chronic disease prevention. The performance of these models is evaluated by applying them to the baseline covariates of participants in external cohort studies, with model predictions compared to subjects' subsequent disease incidence. However, the covariate distribution among participants in a validation cohort may differ from that of the population for which the model will be used. Since estimates of predictive model performance depend on the distribution of covariates among the subjects to which it is applied, such differences can cause misleading estimates of model performance in the target population. We propose a method for addressing this problem by weighting the cohort subjects to make their covariate distribution better match that of the target population. Simulations show that the method provides accurate estimates of model performance in the target population, while un-weighted estimates may not. We illustrate the method by applying it to evaluate an ovarian cancer prediction model targeted to US women, using cohort data from participants in the California Teachers Study. The methods can be implemented using open-source code for public use as the R-package RMAP (Risk Model Assessment Package) available at http://stanford.edu/~ggong/rmap/ .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluating disease prediction models using a cohort whose covariate distribution differs from that of the target population.

Abstract

Talk to us

Similar Papers

More From: Statistical Methods in Medical Research

Lead the way for us

Journal: Statistical Methods in Medical Research	Publication Date: Aug 16, 2017
Citations: 9

Similar Papers

Prognostic models for mortality risk in patients requiring ECMO
Lara C A Pladet ... Dirk W Donker
Intensive Care Medicine | VOL. 49
Lara C A Pladet, et. al.Lara C A Pladet ... Dirk W Donker
04 Jan 2023
Intensive Care Medicine | VOL. 49

Making better Maxent models of species distributions: complexity, overfitting and evaluation
Aleksandar Radosavljevic ... Robert P Anderson
Journal of Biogeography | VOL. 41
Aleksandar Radosavljevic, et. al.Aleksandar Radosavljevic ... Robert P Anderson
06 Dec 2013
Journal of Biogeography | VOL. 41

A Bayesian Approach to Recreational Water Quality Model Validation and Comparison in the Presence of Measurement Error
E Potash ... S Steinschneider
Water Resources Research | VOL. 58
E Potash, et. al.E Potash ... S Steinschneider
01 Jan 2021
Water Resources Research | VOL. 58

On Splitting Training and Validation Set: A Comparative Study of Cross-Validation, Bootstrap and Systematic Sampling for Estimating the Generalization Performance of Supervised Learning
Yun Xu ... Royston Goodacre
Journal of Analysis and Testing | VOL. 2
Yun Xu, et. al.Yun Xu ... Royston Goodacre
01 Jul 2018
Journal of Analysis and Testing | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluating disease prediction models using a cohort whose covariate distribution differs from that of the target population.

Abstract

Talk to us

Similar Papers

More From: Statistical Methods in Medical Research