Imputation strategies for missing binary outcomes in cluster randomized trials

Jinhui Ma,Lisa Dolovich,Lehana Thabane,Noori Akhtar-Danesh

doi:10.1186/1471-2288-11-18

Jinhui Ma, Lisa Dolovich + Show 2 more

Open Access

PDF Available

https://doi.org/10.1186/1471-2288-11-18

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

BackgroundAttrition, which leads to missing data, is a common problem in cluster randomized trials (CRTs), where groups of patients rather than individuals are randomized. Standard multiple imputation (MI) strategies may not be appropriate to impute missing data from CRTs since they assume independent data. In this paper, under the assumption of missing completely at random and covariate dependent missing, we compared six MI strategies which account for the intra-cluster correlation for missing binary outcomes in CRTs with the standard imputation strategies and complete case analysis approach using a simulation study.MethodWe considered three within-cluster and three across-cluster MI strategies for missing binary outcomes in CRTs. The three within-cluster MI strategies are logistic regression method, propensity score method, and Markov chain Monte Carlo (MCMC) method, which apply standard MI strategies within each cluster. The three across-cluster MI strategies are propensity score method, random-effects (RE) logistic regression approach, and logistic regression with cluster as a fixed effect. Based on the community hypertension assessment trial (CHAT) which has complete data, we designed a simulation study to investigate the performance of above MI strategies.ResultsThe estimated treatment effect and its 95% confidence interval (CI) from generalized estimating equations (GEE) model based on the CHAT complete dataset are 1.14 (0.76 1.70). When 30% of binary outcome are missing completely at random, a simulation study shows that the estimated treatment effects and the corresponding 95% CIs from GEE model are 1.15 (0.76 1.75) if complete case analysis is used, 1.12 (0.72 1.73) if within-cluster MCMC method is used, 1.21 (0.80 1.81) if across-cluster RE logistic regression is used, and 1.16 (0.82 1.64) if standard logistic regression which does not account for clustering is used.ConclusionWhen the percentage of missing data is low or intra-cluster correlation coefficient is small, different approaches for handling missing binary outcome data generate quite similar results. When the percentage of missing data is large, standard MI strategies, which do not take into account the intra-cluster correlation, underestimate the variance of the treatment effect. Within-cluster and across-cluster MI strategies (except for random-effects logistic regression MI strategy), which take the intra-cluster correlation into account, seem to be more appropriate to handle the missing outcome from CRTs. Under the same imputation strategy and percentage of missingness, the estimates of the treatment effect from GEE and RE logistic regression models are similar.

Highlights

Cluster randomized trials (CRTs), where groups of participants rather than individuals are randomized, are increasingly being used in health promotion and health services research [1]
The estimated treatment effect and its 95% confidence interval (CI) from generalized estimating equations (GEE) model based on the community hypertension assessment trial (CHAT) complete dataset are 1.14 (0.76 1.70)
When 30% of binary outcome are missing completely at random, a simulation study shows that the estimated treatment effects and the corresponding 95% CIs from GEE model are 1.15 (0.76 1.75) if complete case analysis is used, 1.12 (0.72 1.73) if within-cluster Markov chain Monte Carlo (MCMC) method is used, 1.21 (0.80 1.81) if across-cluster RE logistic regression is used, and 1.16 (0.82 1.64) if standard logistic regression which does not account for clustering is used

Summary

Introduction

Cluster randomized trials (CRTs), where groups of participants rather than individuals are randomized, are increasingly being used in health promotion and health services research [1]. The default approach in dealing with this problem is to use complete case analysis ( called listwise deletion), i.e. exclude the participants with missing data from the analysis Though this approach is easy to use and is the default option in most statistical packages, it may substantially weaken the statistical power of the trial and may lead to biased results depending on the mechanism of the missing data. Attrition, which leads to missing data, is a common problem in cluster randomized trials (CRTs), where groups of patients rather than individuals are randomized. Under the assumption of missing completely at random and covariate dependent missing, we compared six MI strategies which account for the intra-cluster correlation for missing binary outcomes in CRTs with the standard imputation strategies and complete case analysis approach using a simulation study

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Medical Research Methodology	Publication Date: Feb 16, 2011
Citations: 38	License type: cc-by

R Discovery Prime

Imputation strategies for missing binary outcomes in cluster randomized trials

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BMC Medical Research Methodology

Lead the way for us

Similar Papers

Comparison of population-averaged and cluster-specific models for the analysis of cluster randomized trials with missing binary outcomes: a simulation study
Jinhui Ma ... Parminder Raina
BMC Medical Research Methodology | VOL. 13
Jinhui Ma, et. al.Jinhui Ma ... Parminder Raina
23 Jan 2013
BMC Medical Research Methodology | VOL. 13

Comparing the performance of different multiple imputation strategies for missing binary outcomes in cluster randomized trials: a simulation study
Lehana Thabane ... Raina
Open Access Medical Statistics | VOL. 2
Lehana Thabane, et. al.Lehana Thabane ... Raina
01 Dec 2012
Open Access Medical Statistics | VOL. 2

Abstract A09: Uncovering nativity disparities in cancer patterns: A multiple imputation strategy to handle missing nativity data in the SEER database.
Jane R Montealegre ... Renke Zhou
Cancer Epidemiology, Biomarkers & Prevention | VOL. 21
Jane R Montealegre, et. al.Jane R Montealegre ... Renke Zhou
01 Oct 2012
Cancer Epidemiology, Biomarkers & Prevention | VOL. 21

Multiple imputation strategies for a bounded outcome variable in a competing risks analysis.
Elinor Curnow ... Margaret T May
Statistics in medicine | VOL. 40
Elinor Curnow, et. al.Elinor Curnow ... Margaret T May
19 Jan 2021
Statistics in medicine | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Imputation strategies for missing binary outcomes in cluster randomized trials

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BMC Medical Research Methodology