Performance Of Clinical Prediction Models Research Articles

AimClinical prediction models need to be validated. In this study, we used simulation data to compare various internal and external validation approaches to validate models.MethodsData of 500 patients were simulated using distributions of metabolic tumor volume, standardized uptake value, the maximal distance between the largest lesion and another lesion, WHO performance status and age of 296 diffuse large B cell lymphoma patients. These data were used to predict progression after 2 years based on an existing logistic regression model. Using the simulated data, we applied cross-validation, bootstrapping and holdout (n = 100). We simulated new external datasets (n = 100, n = 200, n = 500) and simulated stage-specific external datasets (1), varied the cut-off for high-risk patients (2) and the false positive and false negative rates (3) and simulated a dataset with EARL2 characteristics (4). All internal and external simulations were repeated 100 times. Model performance was expressed as the cross-validated area under the curve (CV-AUC ± SD) and calibration slope.ResultsThe cross-validation (0.71 ± 0.06) and holdout (0.70 ± 0.07) resulted in comparable model performances, but the model had a higher uncertainty using a holdout set. Bootstrapping resulted in a CV-AUC of 0.67 ± 0.02. The calibration slope was comparable for these internal validation approaches. Increasing the size of the test set resulted in more precise CV-AUC estimates and smaller SD for the calibration slope. For test datasets with different stages, the CV-AUC increased as Ann Arbor stages increased. As expected, changing the cut-off for high risk and false positive- and negative rates influenced the model performance, which is clearly shown by the low calibration slope. The EARL2 dataset resulted in similar model performance and precision, but calibration slope indicated overfitting.ConclusionIn case of small datasets, it is not advisable to use a holdout or a very small external dataset with similar characteristics. A single small testing dataset suffers from a large uncertainty. Therefore, repeated CV using the full training dataset is preferred instead. Our simulations also demonstrated that it is important to consider the impact of differences in patient population between training and test data, which may ask for adjustment or stratification of relevant variables.

SummaryBackgroundTuberculosis (TB) clinical prediction rules rely on presence of symptoms, however many undiagnosed cases in the community are asymptomatic. This study aimed to explore the utility of clinical factors in predicting TB among people with HIV not seeking care.MethodsBaseline data were analysed from an observational cohort of ambulant adults with HIV in South Africa. Participants were tested for Mycobacterium tuberculosis (Mtb) sensitisation (interferon-γ release assay, IGRA) and microbiologically-confirmed prevalent pulmonary TB disease at baseline, and actively surveilled for incident TB through 15 months. Multivariable LASSO regression with post-selection inference was used to test associations with Mtb sensitisation and TB disease.FindingsBetween March 22, 2017, and May 15, 2018, 861 participants were enrolled; Among 851 participants included in the analysis, 94·5% were asymptomatic and 45·9% sensitised to Mtb. TB prevalence was 2·0% at baseline and incidence 2·3/100 person-years through 15 months follow-up. Study site was associated with baseline Mtb sensitisation (p < 0·001), prevalent (p < 0·001), and incident TB disease (p = 0·037). Independent of site, higher CD4 counts (per 50 cells/mm3, aOR 1·48, 95%CI 1·12–1·77, p = 0·006) were associated with increased IGRA positivity, and participants without TB disease (aOR 0·80, 95%CI 0·69–0·94, p = 0·006) had reduced IGRA positivity; no variables were independently associated with prevalent TB. Mixed ancestry (aHR 1·49, 95%CI 1·30–>1000, p = 0·005) and antiretroviral initiation (aHR 1·48, 95%CI 1·01–929·93, p = 0·023) were independently associated with incident TB. Models incorporating clinical features alone performed poorly in diagnosing prevalent (AUC 0·65, 95%CI 0·44–0·85) or predicting progression to incident (0·67, 0·46–0·88) TB.InterpretationCD4 count and antiretroviral initiation, proxies for immune status and HIV stage, were associated with Mtb sensitisation and TB disease. Inadequate performance of clinical prediction models may reflect predominantly subclinical disease diagnosed in this setting and unmeasured local site factors affecting transmission and progression.FundingThe CORTIS-HR study was funded by the Bill & Melinda Gates Foundation (OPP1151915) and by the Strategic Health Innovation Partnerships Unit of the South African Medical Research Council with funds received from the South African Department of Science and Technology. The regulatory sponsor was the University of Cape Town.

Performance Of Clinical Prediction Models Research Articles

Related Topics

Articles published on Performance Of Clinical Prediction Models

Impact of random oversampling and random undersampling on the performance of prediction models developed using observational health data

Follow-up ASPECTS improves prediction of potentially lethal malignant edema in patients with large middle cerebral artery stroke

Dynamic updating of clinical survival prediction models in a changing environment

Imputation and missing indicators for handling missing data in the development and deployment of clinical prediction models: A simulation study.

EHR foundation models improve robustness in the presence of temporal distribution shift

Prognosis prediction performs better in patients with non-cirrhosis hepatitis B virus-related acute-on-chronic liver failure than those with cirrhosis.

External validation: a simulation study to compare cross-validation versus holdout or external testing to assess the performance of clinical prediction models using PET data from DLBCL patients

The thrombodynamic ratio as a predictor of 28-day mortality in sepsis patients

Clinical predictors of pulmonary tuberculosis among South African adults with HIV.

Updating Clinical Prediction Models: An Illustrative Case Study.

ROC curves for clinical prediction models part 1. ROC plots showed no added value above the AUC when evaluating the performance of clinical prediction models

ROC curves for clinical prediction models part 3. The ROC plot: a picture that needs a 1000 words

Impact of predictor measurement heterogeneity across settings on the performance of prediction models: A measurement error perspective.

Poor performance of clinical prediction models: the harm of commonly applied methods

A tutorial on variable selection for clinical prediction models: feature selection methods in data mining could improve the results

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Performance Of Clinical Prediction Models Research Articles

Related Topics

Articles published on Performance Of Clinical Prediction Models

Impact of random oversampling and random undersampling on the performance of prediction models developed using observational health data

Follow-up ASPECTS improves prediction of potentially lethal malignant edema in patients with large middle cerebral artery stroke

Dynamic updating of clinical survival prediction models in a changing environment

Imputation and missing indicators for handling missing data in the development and deployment of clinical prediction models: A simulation study.

EHR foundation models improve robustness in the presence of temporal distribution shift

Prognosis prediction performs better in patients with non-cirrhosis hepatitis B virus-related acute-on-chronic liver failure than those with cirrhosis.

External validation: a simulation study to compare cross-validation versus holdout or external testing to assess the performance of clinical prediction models using PET data from DLBCL patients

The thrombodynamic ratio as a predictor of 28-day mortality in sepsis patients

Clinical predictors of pulmonary tuberculosis among South African adults with HIV.

Updating Clinical Prediction Models: An Illustrative Case Study.

ROC curves for clinical prediction models part 1. ROC plots showed no added value above the AUC when evaluating the performance of clinical prediction models

ROC curves for clinical prediction models part 3. The ROC plot: a picture that needs a 1000 words

Impact of predictor measurement heterogeneity across settings on the performance of prediction models: A measurement error perspective.

Poor performance of clinical prediction models: the harm of commonly applied methods

A tutorial on variable selection for clinical prediction models: feature selection methods in data mining could improve the results