Performance Comparison of Imputation Methods for Mixed Data Missing at Random with Small and Large Sample Data Set with Different Variability

Christina Nicole Holder Lewis,Kyei Baffour Afari

doi:10.9734/ajpas/2022/v20i2416

Abstract

One of the concerns in the field of statistics is the presence of missing data, which leads to bias in parameter estimation and inaccurate results. However, the multiple imputation procedure is a remedy for handling missing data. This study looked at the best multiple imputation methods used to handle mixed variable datasets with different sample sizes and variability along with different levels of missingness. The study employed the predictive mean matching, classification and regression trees, and the random forest imputation methods. For each dataset, the multiple regression parameter estimates for the complete datasets were compared to the multiple regression parameter estimates found with the imputed dataset. The results showed that the random forest imputation method was the best for mostly a sample of 500 irrespective of the variability. The classification and regression tree imputation methods worked best mostly on sample of 30 irrespective of the variability.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance Comparison of Imputation Methods for Mixed Data Missing at Random with Small and Large Sample Data Set with Different Variability

Abstract

Talk to us

Similar Papers

More From: Asian Journal of Probability and Statistics

Lead the way for us

Similar Papers

Comparison of Single and MICE Imputation Methods for Missing Values: A Simulation Study
Nurul Azifah Mohd Pauzi ... Sayang Mohd Deni
Pertanika Journal of Science and Technology | VOL. 29
Nurul Azifah Mohd Pauzi, et. al.Nurul Azifah Mohd Pauzi ... Sayang Mohd Deni
30 Apr 2021
Pertanika Journal of Science and Technology | VOL. 29

Comparing Several Missing Data Estimation Methods in Linear Regression;Real Data Example and A Simulation Study
Anwar Fitrianto ... Jap Ee Jia
CAUCHY: Jurnal Matematika Murni dan Aplikasi | VOL. 7
Anwar Fitrianto, et. al.Anwar Fitrianto ... Jap Ee Jia
24 May 2023
CAUCHY: Jurnal Matematika Murni dan Aplikasi | VOL. 7

A Comparison of Multiple Imputation and Optimal Estimation for Missing and Uncertain Urban Air Toxics Data
H Le ... M Depa
Epidemiology | VOL. 17
H Le, et. al.H Le ... M Depa
01 Nov 2006
Epidemiology | VOL. 17

Multi-metric comparison of machine learning imputation methods with application to breast cancer survival
Imad El Badisy ... Roch Giorgi
BMC Medical Research Methodology | VOL. 24
Imad El Badisy, et. al.Imad El Badisy ... Roch Giorgi
30 Aug 2024
BMC Medical Research Methodology | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Comparison of Imputation Methods for Mixed Data Missing at Random with Small and Large Sample Data Set with Different Variability

Abstract

Talk to us

Similar Papers

More From: Asian Journal of Probability and Statistics