Prediction models for clustered data: comparison of a random intercept and standard regression model

Walter Bouwmeester,Jos Wr Twisk,Karel Gm Moons,Yvonne Vergouwe,Wilton A Van Klei,Teus H Kappen

doi:10.1186/1471-2288-13-19

Abstract

BackgroundWhen study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. For prediction research in which the interest of predictor effects is on the patient level, random effect regression models are probably preferred over standard regression analysis. It is well known that the random effect parameter estimates and the standard logistic regression parameter estimates are different. Here, we compared random effect and standard logistic regression models for their ability to provide accurate predictions.MethodsUsing an empirical study on 1642 surgical patients at risk of postoperative nausea and vomiting, who were treated by one of 19 anesthesiologists (clusters), we developed prognostic models either with standard or random intercept logistic regression. External validity of these models was assessed in new patients from other anesthesiologists. We supported our results with simulation studies using intra-class correlation coefficients (ICC) of 5%, 15%, or 30%. Standard performance measures and measures adapted for the clustered data structure were estimated.ResultsThe model developed with random effect analysis showed better discrimination than the standard approach, if the cluster effects were used for risk prediction (standard c-index of 0.69 versus 0.66). In the external validation set, both models showed similar discrimination (standard c-index 0.68 versus 0.67). The simulation study confirmed these results. For datasets with a high ICC (≥15%), model calibration was only adequate in external subjects, if the used performance measure assumed the same data structure as the model development method: standard calibration measures showed good calibration for the standard developed model, calibration measures adapting the clustered data structure showed good calibration for the prediction model with random intercept.ConclusionThe models with random intercept discriminate better than the standard model only if the cluster effect is used for predictions. The prediction model with random intercept had good calibration within clusters.

Highlights

When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used
The differences that we found in calibration parameters between the standard model and random intercept logistic regression model slightly disappeared when the cluster effect was correlated with one of the predictors (Pearson correlation coefficient between cluster and X1 = 0.4, see Additional file 1: Table S1, S2 and S4)
We found in our data, that the predictor effects for postoperative nausea and vomiting (PONV) were different in the random intercept logistic regression model compared to the standard model (Table 1)

Summary

Introduction

When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. We compared random effect and standard logistic regression models for their ability to provide accurate predictions. Study data that are used for model development are frequently clustered within e.g. centers or treating physician [2]. Regression techniques that take clustering into account [3,4,5,6] are frequently used in cluster randomized trials and in etiologic research with subjects clustered within e.g. neighborhoods or countries. Such regression models were hardly used in research aimed at developing prediction models [2]

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Medical Research Methodology	Publication Date: Feb 15, 2013
Citations: 86	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Prediction models for clustered data: comparison of a random intercept and standard regression model

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Research Methodology

Lead the way for us

Similar Papers

Multilevel Analysis of Factors Associated with Child Mortality in Uganda
...
African Journal of Economic Review | VOL. 3
, et. al. ...
01 Jan 2015
African Journal of Economic Review | VOL. 3

Editor's evaluation: Robust and Efficient Assessment of Potency (REAP) as a quantitative tool for dose-response curve estimation
Philip Boonstra
-
Philip BoonstraPhilip Boonstra
09 May 2022
09 May 2022

Comparison of standard and penalized logistic regression in risk model development
Yan Yan ... Varun Puri
JTCVS Open | VOL. 9
Yan Yan, et. al.Yan Yan ... Varun Puri
22 Jan 2022
JTCVS Open | VOL. 9

Impact of Changing the Statistical Methodology on Hospital and Surgeon Ranking
Laurent G Glance ... Andrew Dick
Medical Care | VOL. 44
Laurent G Glance, et. al.Laurent G Glance ... Andrew Dick
01 Jan 2006
Medical Care | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Prediction models for clustered data: comparison of a random intercept and standard regression model

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Research Methodology