Evaluation of Four Multiple Imputation Methods for Handling Missing Binary Outcome Data in the Presence of an Interaction between a Dummy and a Continuous Variable

Sara Javadi,Mohammad Reza Baneshi,Mohammad Mehdi Saber,Behshid Garrusi,Abbas Bahrampour

doi:10.1155/2021/6668822

Abstract

Multiple imputation by chained equations (MICE) is the most common method for imputing missing data. In the MICE algorithm, imputation can be performed using a variety of parametric and nonparametric methods. The default setting in the implementation of MICE is for imputation models to include variables as linear terms only with no interactions, but omission of interaction terms may lead to biased results. It is investigated, using simulated and real datasets, whether recursive partitioning creates appropriate variability between imputations and unbiased parameter estimates with appropriate confidence intervals. We compared four multiple imputation (MI) methods on a real and a simulated dataset. MI methods included using predictive mean matching with an interaction term in the imputation model in MICE (MICE-interaction), classification and regression tree (CART) for specifying the imputation model in MICE (MICE-CART), the implementation of random forest (RF) in MICE (MICE-RF), and MICE-Stratified method. We first selected secondary data and devised an experimental design that consisted of 40 scenarios (2 × 5 × 4), which differed by the rate of simulated missing data (10%, 20%, 30%, 40%, and 50%), the missing mechanism (MAR and MCAR), and imputation method (MICE-Interaction, MICE-CART, MICE-RF, and MICE-Stratified). First, we randomly drew 700 observations with replacement 300 times, and then the missing data were created. The evaluation was based on raw bias (RB) as well as five other measurements that were averaged over the repetitions. Next, in a simulation study, we generated data 1000 times with a sample size of 700. Then, we created missing data for each dataset once. For all scenarios, the same criteria were used as for real data to evaluate the performance of methods in the simulation study. It is concluded that, when there is an interaction effect between a dummy and a continuous predictor, substantial gains are possible by using recursive partitioning for imputation compared to parametric methods, and also, the MICE-Interaction method is always more efficient and convenient to preserve interaction effects than the other methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Probability and Statistics	Publication Date: May 17, 2021
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Evaluation of Four Multiple Imputation Methods for Handling Missing Binary Outcome Data in the Presence of an Interaction between a Dummy and a Continuous Variable

Abstract

Talk to us

Similar Papers

More From: Journal of Probability and Statistics

Lead the way for us

Similar Papers

Accuracy of Five Multiple Imputation Methods in Estimating Prevalence of Type 2 Diabetes based on STEPS Surveys
Hamid Heidarian Miri ... Ehsan Baradaran Sirjani
Journal of Epidemiology and Global Health | VOL. 10
Hamid Heidarian Miri, et. al.Hamid Heidarian Miri ... Ehsan Baradaran Sirjani
08 Jan 2020
Journal of Epidemiology and Global Health | VOL. 10

A Comparative Study of Imputation Methods for Multivariate Ordinal Data
Chayut Wongkamthong ... Olanrewaju Akande
Journal of Survey Statistics and Methodology | VOL. 11
Chayut Wongkamthong, et. al.Chayut Wongkamthong ... Olanrewaju Akande
09 Oct 2021
Journal of Survey Statistics and Methodology | VOL. 11

The application of nonparametric data augmentation and imputation using classification and regression trees within a large-scale panel study

-

01 Jan 2017
01 Jan 2017

A comparison of imputation techniques for handling missing predictor values in a risk model with a binary outcome
Gareth Ambler ... Rumana Z Omar
Statistical Methods in Medical Research | VOL. 16
Gareth Ambler, et. al.Gareth Ambler ... Rumana Z Omar
01 Jun 2007
Statistical Methods in Medical Research | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluation of Four Multiple Imputation Methods for Handling Missing Binary Outcome Data in the Presence of an Interaction between a Dummy and a Continuous Variable

Abstract

Talk to us

Similar Papers

More From: Journal of Probability and Statistics