Review: A gentle introduction to imputation of missing values

A Rogier T Donders,Geert J.M.G Van Der Heijden,Theo Stijnen,Karel G.M Moons

doi:10.1016/j.jclinepi.2006.01.014

A Rogier T Donders, Geert J.M.G Van Der Heijden + Show 2 more

https://doi.org/10.1016/j.jclinepi.2006.01.014

Copy DOI

Abstract

In most situations, simple techniques for handling missing data (such as complete case analysis, overall mean imputation, and the missing-indicator method) produce biased results, whereas imputation techniques yield valid results without complicating the analysis once the imputations are carried out. Imputation techniques are based on the idea that any subject in a study sample can be replaced by a new randomly chosen subject from the same source population. Imputation of missing data on a variable is replacing that missing by a value that is drawn from an estimate of the distribution of this variable. In single imputation, only one estimate is used. In multiple imputation, various estimates are used, reflecting the uncertainty in the estimation of this distribution. Under the general conditions of so-called missing at random and missing completely at random, both single and multiple imputations result in unbiased estimates of study associations. But single imputation results in too small estimated standard errors, whereas multiple imputation results in correctly estimated standard errors and confidence intervals. In this article we explain why all this is the case, and use a simple simulation study to demonstrate our explanations. We also explain and illustrate why two frequently used methods to handle missing data, i.e., overall mean imputation and the missing-indicator method, almost always result in biased estimates.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Review: A gentle introduction to imputation of missing values

Abstract

Talk to us

Similar Papers

More From: Journal of Clinical Epidemiology

Lead the way for us

Journal: Journal of Clinical Epidemiology	Publication Date: Jul 11, 2006
Citations: 2138

Similar Papers

What is missing from my missing data plan?
Sharon D Yeatts ... Renée H Martin
Stroke | VOL. 46
Sharon D Yeatts, et. al.Sharon D Yeatts ... Renée H Martin
07 May 2015
Stroke | VOL. 46

Comparison of techniques for handling missing covariate data within prognostic modelling studies: a simulation study
Andrea Marshall ... Patrick Royston
BMC Medical Research Methodology | VOL. 10
Andrea Marshall, et. al.Andrea Marshall ... Patrick Royston
19 Jan 2010
BMC Medical Research Methodology | VOL. 10

Imputation of missing values is superior to complete case analysis and the missing-indicator method in multivariable diagnostic research: A clinical example
Geert J.M.G Van Der Heijden ... Karel G.M Moons
Journal of Clinical Epidemiology | VOL. 59
Geert J.M.G Van Der Heijden, et. al.Geert J.M.G Van Der Heijden ... Karel G.M Moons
11 Jul 2006
Journal of Clinical Epidemiology | VOL. 59

Dealing With Missing Outcome Data in Randomized Trials and Observational Studies
Rolf H H Groenwold ... Frank E Harrell
American Journal of Epidemiology | VOL. 175
Rolf H H Groenwold, et. al.Rolf H H Groenwold ... Frank E Harrell
23 Dec 2011
American Journal of Epidemiology | VOL. 175

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Review: A gentle introduction to imputation of missing values

Abstract

Talk to us

Similar Papers

More From: Journal of Clinical Epidemiology