Multiply imputing missing values in data sets with mixed measurement scales using a sequence of generalised linear models

Min Cherng Lee,Robin Mitra

doi:10.1016/j.csda.2015.08.004

Abstract

Multiple imputation is a commonly used approach to deal with missing values. In this approach, an imputer repeatedly imputes the missing values by taking draws from the posterior predictive distribution for the missing values conditional on the observed values, and releases these completed data sets to analysts. With each completed data set the analyst performs the analysis of interest, treating the data as if it were fully observed. These analyses are then combined with standard combining rules, allowing the analyst to make appropriate inferences which take into account the uncertainty present due to the missing data. In order to preserve the statistical properties present in the data, the imputer must use a plausible distribution to generate the imputed values. In data sets containing variables with different measurement scales, e.g. some categorical and some continuous variables, this is a challenging problem. A method is proposed to multiply impute missing values in such data sets by modelling the joint distribution of the variables in the data through a sequence of generalised linear models, and data augmentation methods are used to draw imputations from a proper posterior distribution using Markov Chain Monte Carlo (MCMC). The performance of the proposed method is illustrated using simulation studies and on a data set taken from a breast feeding study.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multiply imputing missing values in data sets with mixed measurement scales using a sequence of generalised linear models

Abstract

Talk to us

Similar Papers

More From: Computational Statistics & Data Analysis

Lead the way for us

Journal: Computational Statistics & Data Analysis	Publication Date: Sep 9, 2015
Citations: 25

Similar Papers

Missing Value Imputation with Unsupervised Backpropagation
Michael S Gashler ... Tony Martinez
Computational Intelligence | VOL. 32
Michael S Gashler, et. al.Michael S Gashler ... Tony Martinez
01 Jul 2014
Computational Intelligence | VOL. 32

The application of nonparametric data augmentation and imputation using classification and regression trees within a large-scale panel study

-

01 Jan 2017
01 Jan 2017

A Hybrid Approach for Missing Data Imputation in Gene Expression Dataset Using Extra Tree Regressor and a Genetic Algorithm
Amarjeet Yadav ... Aditya Dubey
-
Amarjeet Yadav, et. al.Amarjeet Yadav ... Aditya Dubey
01 Jan 2023
01 Jan 2023

Handling Missing Values in Chronic Kidney Disease Datasets Using KNN, K-Means and K-Medoids Algorithms
Tahira Mahboob ... Muqadas Kalsoom
-
Tahira Mahboob, et. al.Tahira Mahboob ... Muqadas Kalsoom
01 Dec 2018
01 Dec 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiply imputing missing values in data sets with mixed measurement scales using a sequence of generalised linear models

Abstract

Talk to us

Similar Papers

More From: Computational Statistics &amp; Data Analysis

More From: Computational Statistics & Data Analysis