Abstract

This article reports empirical explorations of how well the predictive mean matching method for imputing missing data works for an often problematic variable—income—when income is used as an explanatory variable in a substantive regression model. It is found that the performance of the predictive mean method varies considerably with the predictive power of the imputation regression model and the percentage of cases with missing data on income. In comparisons of single-value with multiple-imputation methods, it also is found that the amount of bias and the loss of precision associated with single-value methods is considerably less than that associated with a weak imputation model. Situations in which using imputed data can lead to seriously biased estimates of regression coefficients (and related statistics) and situations in which the bias is so minimal as to be nonproblematic are identified.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.