Efficient and Doubly Robust Imputation for Covariate-Dependent Missing Responses

Jing Qin,Jun Shao,Biao Zhang

doi:10.1198/016214508000000238

Abstract

In this article we study a well-known response missing-data problem. Missing data is an ubiquitous problem in medical and social science studies. Imputation is one of the most popular methods for dealing with missing data. The most commonly used imputation that makes use of covariates is regression imputation, in which the regression model can be parametric, semiparametric, or nonparametric. Parametric regression imputation is efficient but is not robust against misspecification of the regression model. Although nonparametric regression imputation (such as nearest-neighbor imputation and kernel regression imputation) is model-free, it is not efficient, especially if the dimension of covariate vector is high (the well-known problem of curse of dimensionality). Semiparametric regression imputation (such as partially linear regression imputation) can reduce the dimension of the covariate in nonparametric regression fitting but is not robust against misspecification of the linear component in the regression. Assuming that the missing mechanism is covariate-dependent and that the propensity function can be specified correctly, we propose a regression imputation method that has good efficiency and is robust against regression model misspecification. Furthermore, our method is valid as long as either the regression model or the propensity model is correct, a property known as the double-robustness property. We show that asymptotically the sample mean based on our imputation achieves the semiparametric efficient lower bound if both regression and propensity models are specified correctly. Our simulation results demonstrate that the proposed method outperforms many existing methods for handling missing data, especially when the regression model is misspecified. As an illustration, an economic observational data set is analyzed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient and Doubly Robust Imputation for Covariate-Dependent Missing Responses

Abstract

Talk to us

Similar Papers

More From: Journal of the American Statistical Association

Lead the way for us

Journal: Journal of the American Statistical Association	Publication Date: Jun 1, 2008
Citations: 78

Similar Papers

Bias in regression coefficient estimates upon different treatments of systematically missing data
L A Othuon
East African Journal of Statistics | VOL. 1
L A OthuonL A Othuon
17 Jul 2007
East African Journal of Statistics | VOL. 1

Bias in regression coefficient estimates upon different treatments of systematically missing data
Lo Othuon
East African Journal of Statistics | VOL. 1
Lo OthuonLo Othuon
17 Jul 2007
East African Journal of Statistics | VOL. 1

Simulation study on missing data imputation methods for longitudinal data in cohort studies
Y M Li ... F Y Chen
Zhonghua liu xing bing xue za zhi = Zhonghua liuxingbingxue zazhi | VOL. 42
Y M Li, et. al.Y M Li ... F Y Chen
10 Oct 2021
Zhonghua liu xing bing xue za zhi = Zhonghua liuxingbingxue zazhi | VOL. 42

Imputation and missing indicators for handling missing data in the development and deployment of clinical prediction models: A simulation study.
Rose Sisk ... Matthew Sperrin
Statistical Methods in Medical Research | VOL. 32
Rose Sisk, et. al.Rose Sisk ... Matthew Sperrin
27 Apr 2023
Statistical Methods in Medical Research | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient and Doubly Robust Imputation for Covariate-Dependent Missing Responses

Abstract

Talk to us

Similar Papers

More From: Journal of the American Statistical Association