A Comparative Study of Imputation Methods for Multivariate Ordinal Data

Chayut Wongkamthong,Olanrewaju Akande

doi:10.1093/jssam/smab028

Abstract

Abstract Missing data remains a very common problem in large datasets, including survey and census data containing many ordinal responses, such as political polls and opinion surveys. Multiple imputation (MI) is usually the go-to approach for analyzing such incomplete datasets, and there are indeed several implementations of MI, including methods using generalized linear models, tree-based models, and Bayesian non-parametric models. However, there is limited research on the statistical performance of these methods for multivariate ordinal data. In this article, we perform an empirical evaluation of several MI methods, including MI by chained equations (MICE) using multinomial logistic regression models, MICE using proportional odds logistic regression models, MICE using classification and regression trees, MICE using random forest, MI using Dirichlet process (DP) mixtures of products of multinomial distributions, and MI using DP mixtures of multivariate normal distributions. We evaluate the methods using simulation studies based on ordinal variables selected from the 2018 American Community Survey. Under our simulation settings, the results suggest that MI using proportional odds logistic regression models, classification and regression trees, and DP mixtures of multinomial distributions generally outperform the other methods. In certain settings, MI using multinomial logistic regression models is able to achieve comparable performance, depending on the missing data mechanism and amount of missing data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Comparative Study of Imputation Methods for Multivariate Ordinal Data

Abstract

Talk to us

Similar Papers

More From: Journal of Survey Statistics and Methodology

Lead the way for us

Journal: Journal of Survey Statistics and Methodology	Publication Date: Oct 9, 2021
Citations: 7

Similar Papers

Evaluation of Four Multiple Imputation Methods for Handling Missing Binary Outcome Data in the Presence of an Interaction between a Dummy and a Continuous Variable
Sara Javadi ... Abbas Bahrampour
Journal of Probability and Statistics | VOL. 2021
Sara Javadi, et. al.Sara Javadi ... Abbas Bahrampour
17 May 2021
Journal of Probability and Statistics | VOL. 2021

Accuracy of Five Multiple Imputation Methods in Estimating Prevalence of Type 2 Diabetes based on STEPS Surveys
Hamid Heidarian Miri ... Ehsan Baradaran Sirjani
Journal of Epidemiology and Global Health | VOL. 10
Hamid Heidarian Miri, et. al.Hamid Heidarian Miri ... Ehsan Baradaran Sirjani
08 Jan 2020
Journal of Epidemiology and Global Health | VOL. 10

The use of multiple imputation for the accurate measurements of individual feed intake by electronic feeders.
S Jiao ... F Tiezzi
Journal of Animal Science | VOL. 94
S Jiao, et. al.S Jiao ... F Tiezzi
01 Feb 2016
Journal of Animal Science | VOL. 94

Foetal ultrasound measurement imputations based on growth curves versus multiple imputation chained equation (MICE)
Kelly K Ferguson ... Bhramar Mukherjee
Paediatric and Perinatal Epidemiology | VOL. 32
Kelly K Ferguson, et. al.Kelly K Ferguson ... Bhramar Mukherjee
17 Jul 2018
Paediatric and Perinatal Epidemiology | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comparative Study of Imputation Methods for Multivariate Ordinal Data

Abstract

Talk to us

Similar Papers

More From: Journal of Survey Statistics and Methodology