Multiple Imputation via Generative Adversarial Network for High-dimensional Blockwise Missing Value Problems.

Zongyu Dai,Zhiqi Bu,Qi Long

doi:10.1109/icmla52953.2021.00131

Abstract

Missing data are present in most real world problems and need careful handling to preserve the prediction accuracy and statistical consistency in the downstream analysis. As the gold standard of handling missing data, multiple imputation (MI) methods are proposed to account for the imputation uncertainty and provide proper statistical inference. In this work, we propose Multiple Imputation via Generative Adversarial Network (MI-GAN), a deep learning-based (in specific, a GAN-based) multiple imputation method, that can work under missing at random (MAR) mechanism with theoretical support. MI-GAN leverages recent progress in conditional generative adversarial neural works and shows strong performance matching existing state-of-the-art imputation methods on high-dimensional datasets, in terms of imputation error. In particular, MI-GAN significantly outperforms other imputation methods in the sense of statistical inference and computational speed.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multiple Imputation via Generative Adversarial Network for High-dimensional Blockwise Missing Value Problems.

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications

Lead the way for us

Journal: Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications	Publication Date: Dec 1, 2021
Citations: 11

Similar Papers

How to deal with missing longitudinal data in cost of illness analysis in Alzheimer's disease-suggestions from the GERAS observational study.
Mark Belger ... Catherine Reed
BMC Medical Research Methodology | VOL. 16
Mark Belger, et. al.Mark Belger ... Catherine Reed
18 Jul 2016
BMC Medical Research Methodology | VOL. 16

Comparative Study of Four Methods in Missing Value Imputations under Missing Completely at Random Mechanism
Michikazu Nakai ... Kunihiro Nishimura
Open journal of statistics | VOL. 04
Michikazu Nakai, et. al.Michikazu Nakai ... Kunihiro Nishimura
01 Jan 2014
Open journal of statistics | VOL. 04

A Comparison of Multiple Imputation and Optimal Estimation for Missing and Uncertain Urban Air Toxics Data
H Le ... S Batterman
Epidemiology | VOL. 17
H Le, et. al.H Le ... S Batterman
01 Nov 2006
Epidemiology | VOL. 17

Evaluation of Four Multiple Imputation Methods for Handling Missing Binary Outcome Data in the Presence of an Interaction between a Dummy and a Continuous Variable
Sara Javadi ... Marek T Malinowski
Journal of Probability and Statistics | VOL. 2021
Sara Javadi, et. al.Sara Javadi ... Marek T Malinowski
17 May 2021
Journal of Probability and Statistics | VOL. 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiple Imputation via Generative Adversarial Network for High-dimensional Blockwise Missing Value Problems.

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications