Mean Absolute Prediction Error Research Articles

Prognostic models are becoming increasingly relevant in clinical trials as potential surrogate endpoints, and for patient management as clinical decision support tools. However, the impact of competing risks on model performance remains poorly investigated. We aimed to carefully assess the performance of competing risk and noncompeting risk models in the context of kidney transplantation, where allograft failure and death with a functioning graft are two competing outcomes. We included 11,046 kidney transplant recipients enrolled in 10 countries. We developed prediction models for long-term kidney graft failure prediction, without accounting (i.e., censoring) and accounting for the competing risk of death with a functioning graft, using Cox, Fine-Gray, and cause-specific Cox regression models. To this aim, we followed a detailed and transparent analytical framework for competing and noncompeting risk modelling, and carefully assessed the models' development, stability, discrimination, calibration, overall fit, clinical utility, and generalizability in external validation cohorts and subpopulations. More than 15 metrics were used to provide an exhaustive assessment of model performance. Among 11,046 recipients in the derivation and validation cohorts, 1,497 (14%) lost their graft and 1,003 (9%) died with a functioning graft after a median follow-up post-risk evaluation of 4.7 years (IQR 2.7-7.0). The cumulative incidence of graft loss was similarly estimated by Kaplan-Meier and Aalen-Johansen methods (17% versus 16% in the derivation cohort). Cox and competing risk models showed similar and stable risk estimates for predicting long-term graft failure (average mean absolute prediction error of 0.0140, 0.0138 and 0.0135 for Cox, Fine-Gray, and cause-specific Cox models, respectively). Discrimination and overall fit were comparable in the validation cohorts, with concordance index ranging from 0.76 to 0.87. Across various subpopulations and clinical scenarios, the models performed well and similarly, although in some high-risk groups (such as donors over 65 years old), the findings suggest a trend towards moderately improved calibration when using a competing risk approach. Competing and noncompeting risk models performed similarly in predicting long-term kidney graft failure.

Read full abstract

ObjectiveThe development of clinical prediction models is often impeded by the occurrence of missing values in the predictors. Various methods for imputing missing values before modelling have been proposed. Some of them are based on variants of multiple imputation by chained equations, while others are based on single imputation. These methods may include elements of flexible modelling or machine learning algorithms, and for some of them user-friendly software packages are available. The aim of this study was to investigate by simulation if some of these methods consistently outperform others in performance measures of clinical prediction models. Study Design and SettingWe simulated development and validation cohorts by mimicking observed distributions of predictors and outcome variable of a real data set. In the development cohorts, missing predictor values were created in 36 scenarios defined by the missingness mechanism and proportion of non-complete cases. We applied three imputation algorithms that were available in R software: mice, aregImpute and missForest. These algorithms differed in their use of linear or flexible models, or random forests, the way of sampling from the predictive posterior distribution, and the generation of a single or multiple imputed data sets. For multiple imputation we also investigated the impact of the number of imputations. Logistic regression models were fitted with the simulated development cohorts before (full data analysis) and after missing value generation (complete case analysis), and with the imputed data. Prognostic model performance was measured by the scaled Brier score, c-statistic, calibration intercept and slope, and by the mean absolute prediction error evaluated in validation cohorts without missing values. Performance of full data analysis was considered as ideal. ResultsNone of the imputation methods achieved the model’s predictive accuracy that would be obtained in case of no missingness. In general, complete case analysis yielded the worst performance, and deviation from ideal performance increased with increasing percentage of missingness and decreasing sample size. Across all scenarios and performance measures, aregImpute and mice, both with 100 imputations, resulted in highest predictive accuracy. Surprisingly aregImpute outperformed full data analysis in achieving calibration slopes very close to 1 across all scenarios and outcome models. The increase of mice’s performance with 100 compared to 5 imputations was only marginal. The differences between the imputation methods decreased with increasing sample sizes and decreasing proportion of non-complete cases. ConclusionIn our simulation study, model calibration was more affected by the choice of the imputation method than model discrimination. While differences in model performance after using imputation methods were generally small, multiple imputation methods as mice and aregImpute that can handle linear or nonlinear associations between predictors and outcome are an attractive and reliable choice in most situations.

Read full abstract

Mean Absolute Prediction Error Research Articles

Related Topics

Articles published on Mean Absolute Prediction Error

Automatic Landmark Detection for Preoperative Planning of High Tibial Osteotomy Using Traditional Feature Extraction and Deep Learning Methods.

Competing and Noncompeting Risk Models for Predicting Kidney Allograft Failure.

Refractive outcomes following simultaneous silicone oil removal and ciliary sulcus intraocular lens implantation.

Gradient boosting: A computationally efficient alternative to Markov chain Monte Carlo sampling for fitting large Bayesian spatio-temporal binomial regression models

Comparison of the Accuracy Between the Z CALC2 Calculator and Barrett Toric Calculator in Toric IOL Calculation.

Bayesian impedance deconvolution using timescale distribution for lithium-ion battery state estimation

Effect of Posterior Keratometry and Corneal Radius Ratio on the Accuracy of Intraocular Lens Formulas After Myopic LASIK/PRK.

The performance of prognostic models depended on the choice of missing value imputation algorithm: a simulation study

Prediction of body condition score throughout lactation by random regression test-day models.

Advancing polar motion prediction with derivative information

Data-based modeling of the Pharmacodynamics for the effect of Propofol and Remifentanil during General Anesthesia

A critical review of the use of R2 in risk equalization research.

Accuracy Validation of the New Barrett True Axial Length Formula and the Optimized Lens Factor Using Sum-of-Segment Biometry.

Retrospective Study of Factors Affecting the Accuracy of Predicting Vancomycin Concentrations in Patients Aged 75 Years and Above.

Recording of single-unit activities with flexible micro-electrocorticographic array in rats for decoding of whole-body navigation

Developing Safety Performance Functions for Severe Distraction-Related Crashes along Kentucky’s Rural and Urban Two-Lane Roadways

Accuracy of 4 Different Methods for Estimation of Remaining Growth and Timing of Epiphysiodesis.

Accuracy of intraocular lens power formulas in patients with average keratometry greater than 46 diopters

Fellow eye data for intraocular lens calculation in eyes undergoing combined phacovitrectomy.

Intraocular lens tilt and decentration after primary and delayed implantation in phacovitrectomy for macula-off rhegmatogenous retinal detachment repair.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Mean Absolute Prediction Error Research Articles

Related Topics

Articles published on Mean Absolute Prediction Error

Automatic Landmark Detection for Preoperative Planning of High Tibial Osteotomy Using Traditional Feature Extraction and Deep Learning Methods.

Competing and Noncompeting Risk Models for Predicting Kidney Allograft Failure.

Refractive outcomes following simultaneous silicone oil removal and ciliary sulcus intraocular lens implantation.

Gradient boosting: A computationally efficient alternative to Markov chain Monte Carlo sampling for fitting large Bayesian spatio-temporal binomial regression models

Comparison of the Accuracy Between the Z CALC2 Calculator and Barrett Toric Calculator in Toric IOL Calculation.

Bayesian impedance deconvolution using timescale distribution for lithium-ion battery state estimation

Effect of Posterior Keratometry and Corneal Radius Ratio on the Accuracy of Intraocular Lens Formulas After Myopic LASIK/PRK.

The performance of prognostic models depended on the choice of missing value imputation algorithm: a simulation study

Prediction of body condition score throughout lactation by random regression test-day models.

Advancing polar motion prediction with derivative information

Data-based modeling of the Pharmacodynamics for the effect of Propofol and Remifentanil during General Anesthesia

A critical review of the use of R2 in risk equalization research.

Accuracy Validation of the New Barrett True Axial Length Formula and the Optimized Lens Factor Using Sum-of-Segment Biometry.

Retrospective Study of Factors Affecting the Accuracy of Predicting Vancomycin Concentrations in Patients Aged 75 Years and Above.

Recording of single-unit activities with flexible micro-electrocorticographic array in rats for decoding of whole-body navigation

Developing Safety Performance Functions for Severe Distraction-Related Crashes along Kentucky’s Rural and Urban Two-Lane Roadways

Accuracy of 4 Different Methods for Estimation of Remaining Growth and Timing of Epiphysiodesis.

Accuracy of intraocular lens power formulas in patients with average keratometry greater than 46 diopters

Fellow eye data for intraocular lens calculation in eyes undergoing combined phacovitrectomy.

Intraocular lens tilt and decentration after primary and delayed implantation in phacovitrectomy for macula-off rhegmatogenous retinal detachment repair.