Comparison of five imputation methods in handling missing data in a continuous frequency table

M B Mohammed,M B Adam,H S Zulkafli,I A Baba,N Ali

doi:10.1063/5.0053286

Abstract

Missing data are sometimes inevitable that could affect the overall results of research. Sometimes missing data that occurs in data render the continuous frequency table incomplete, and hence the need to estimate them to arrive at valid results. Thus to estimate the missing data, it is appropriate to use one of the scientific imputation methods reported in the literature. This study aims to compare five different missing data imputation methods, mean imputation, median imputation, k nearest neighbors, sample imputation, and multiple imputations by using chained equations (MICE). The five imputation methods are compared using four real datasets. Nine different percentages of missingness are introduced completely at random into the datasets. The statistical metric, root-mean-squared error (RMSE), is used to assess the performance of the methods. Results show that the multiple imputations by using chained equations (MICE) outperformed the other imputation methods. The mean and k nearest neighbor (KNN) performed better relative to sample and median imputation methods. The five imputation methods’ performance is independent of the dataset and the percentage of missingness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison of five imputation methods in handling missing data in a continuous frequency table

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Simulation study on missing data imputation methods for longitudinal data in cohort studies
Y M Li ... F Y Chen
Zhonghua liu xing bing xue za zhi = Zhonghua liuxingbingxue zazhi | VOL. 42
Y M Li, et. al.Y M Li ... F Y Chen
10 Oct 2021
Zhonghua liu xing bing xue za zhi = Zhonghua liuxingbingxue zazhi | VOL. 42

Advanced methods for missing values imputation based on similarity learning.
Khaled M Fouad ... Mona M Arafa
PeerJ. Computer science | VOL. 7
Khaled M Fouad, et. al.Khaled M Fouad ... Mona M Arafa
21 Jul 2021
PeerJ. Computer science | VOL. 7

Evaluating Methods for Imputing Missing Data from Longitudinal Monitoring of Athlete Workload.
Lauren C Benson ... Carlyn Stilling
Journal of Sports Science and Medicine | VOL. 20
Lauren C Benson, et. al.Lauren C Benson ... Carlyn Stilling
05 Mar 2021
Journal of Sports Science and Medicine | VOL. 20

Identifying reprioritization response shift in a stroke caregiver population: a comparison of missing data methods.
Tolulope T Sajobi ... Gurbakhshash Singh
Quality of Life Research | VOL. 24
Tolulope T Sajobi, et. al.Tolulope T Sajobi ... Gurbakhshash Singh
26 Oct 2014
Quality of Life Research | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of five imputation methods in handling missing data in a continuous frequency table

Abstract

Talk to us

Similar Papers