Abstract
AbstractMissing data are a common problem in statistics. Imputation, or filling in the missing values, is an intuitive and flexible way to address the resulting incomplete data sets. We focus on multiple imputation, which, when implemented correctly, can be a statistically valid strategy for handling missing data. The analysis of a multiply‐imputed data set is now relatively standard using readily available statistical software. The creation of multiply‐imputed data sets is more challenging than their analysis but still straightforward relative to other valid methods of handling missing data, and we discuss available software for doing so.Ad hocmethods, including using singly‐imputed data sets, almost always lead to invalid inferences and should be eschewed, especially when valid interval estimation or hypothesis testing is the objective.WIREs Comput Stat2013, 5:20–29. doi: 10.1002/wics.1240This article is categorized under:Statistical and Graphical Methods of Data Analysis > Data Reduction, Smoothing, and FilteringStatistical and Graphical Methods of Data Analysis > Sampling
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.