Bootstrap inference for multiple imputation under uncongeniality and misspecification.

Jonathan W Bartlett,Rachael A Hughes

doi:10.1177/0962280220932189

Jonathan W Bartlett, Rachael A Hughes

Open Access

https://doi.org/10.1177/0962280220932189

Copy DOI

Abstract

Multiple imputation has become one of the most popular approaches for handling missing data in statistical analyses. Part of this success is due to Rubin’s simple combination rules. These give frequentist valid inferences when the imputation and analysis procedures are so-called congenial and the embedding model is correctly specified, but otherwise may not. Roughly speaking, congeniality corresponds to whether the imputation and analysis models make different assumptions about the data. In practice, imputation models and analysis procedures are often not congenial, such that tests may not have the correct size, and confidence interval coverage deviates from the advertised level. We examine a number of recent proposals which combine bootstrapping with multiple imputation and determine which are valid under uncongeniality and model misspecification. Imputation followed by bootstrapping generally does not result in valid variance estimates under uncongeniality or misspecification, whereas certain bootstrap followed by imputation methods do. We recommend a particular computationally efficient variant of bootstrapping followed by imputation.

Highlights

Multiple imputation (MI) has proven to be an extremely versatile and popular tool for handling missing data in statistical analyses
We investigate the properties of the different combinations of MI and bootstrap which have been recommended by these previous papers, giving particular emphasis to their validity under uncongeniality or model misspecification
We have reviewed a number of proposals for combining MI with bootstrapping, in particular with regards to their statistical validity when imputation and analysis procedures are uncongenial or misspecified

Summary

Introduction

Multiple imputation (MI) has proven to be an extremely versatile and popular tool for handling missing data in statistical analyses. Rubin’s variance estimator combines the average within-imputation variance with the betweenimputation variance in estimates This requires an estimator of the complete data variance, which for most estimators is available analytically. On the basis of theoretical and empirical investigation, they recommended three of the four variants for use They did not explicitly seek to investigate performance under uncongeniality or model misspecification . We investigate the properties of the different combinations of MI and bootstrap which have been recommended by these previous papers, giving particular emphasis to their validity under uncongeniality or model misspecification.

Rubin’s rules

Congeniality

Imputation followed by bootstrapping

Bootstrap followed by MI

Boot MI von Hippel

Regression models under uncongeniality or misspecification

Reference-based imputation in clinical trials

Discussion

Declaration of conflicting interests

Findings

Methods

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Statistical Methods in Medical Research	Publication Date: Jun 30, 2020
Citations: 56	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Bootstrap inference for multiple imputation under uncongeniality and misspecification.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Statistical Methods in Medical Research

Lead the way for us

Similar Papers

A Simplified Framework for Using Multiple Imputation in Social Work Research
R A Rose ... M W Fraser
Social Work Research | VOL. 32
R A Rose, et. al.R A Rose ... M W Fraser
01 Sep 2008
Social Work Research | VOL. 32

Multiple imputation of missing data under missing at random: compatible imputation models are not sufficient to avoid bias if they are mis-specified
Elinor Curnow ... Kate Tilling
Journal of Clinical Epidemiology | VOL. 160
Elinor Curnow, et. al.Elinor Curnow ... Kate Tilling
19 Jun 2023
Journal of Clinical Epidemiology | VOL. 160

Nonlinear multiple imputation for continuous covariate within semiparametric Cox model: application to HIV data in Senegal
Jules Brice Tchatchueng Mbougua ... Christian Laurent
Statistics in Medicine | VOL. 32
Jules Brice Tchatchueng Mbougua, et. al.Jules Brice Tchatchueng Mbougua ... Christian Laurent
28 May 2013
Statistics in Medicine | VOL. 32

Quantifying the impact of fixed effects modeling of clusters in multiple imputation for cluster randomized trials.
Rebecca R Andridge
Biometrical Journal | VOL. 53
Rebecca R AndridgeRebecca R Andridge
24 Jan 2011
Biometrical Journal | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bootstrap inference for multiple imputation under uncongeniality and misspecification.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Statistical Methods in Medical Research