Missing Value Imputation Methods for Electronic Health Records

Konstantinos Psychogyios,Loukas Ilias,Christos Ntanos,Dimitris Askounis

doi:10.1109/access.2023.3251919

Konstantinos Psychogyios, Loukas Ilias + Show 2 more

Open Access

https://doi.org/10.1109/access.2023.3251919

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2023
Citations: 22	License type: CC BY-NC-ND 4.0

Affiliation: National Technical University of Athens

Abstract

Electronic health records (EHR) are patient-level information, e.g., laboratory tests and questionnaires, stored in electronic format. Compared to physical records, the EHR alternative allows patients to access their data easily and helps staff with management procedural tasks such as information sharing across different organizations. Moreover, this type of data is commonly used by researchers for predictive and classification purposes, employing statistical and machine learning methods. However, missingness is a phenomenon that is observed very frequently for such measurements. Even though this missingness is often significant, it is usually treated poorly with either case deletion or simple methods, resulting in suboptimal and/or inaccurate predictive results. This happens because the simple methods, e.g., k-nearest neighbors (kNN) and mean/mode imputation, fail in most cases to incorporate the complex relationships that define these medical datasets. To address these limitations, in this paper we test and improve state-of-the-art missing data imputation models and practices. We propose a new missing value imputation method based on denoising autoencoders (DAE) with kNN for the pre-imputation task. We optimize the training methodology by re-applying kNN to the missing data every <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">N</i> epochs using a different value for the variable <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">k</i> each time to yield more accurate results. We also revise a state-of-the-art missing data imputation approach based on a generative adversarial network (GAN). Using this as a baseline, we introduce improvements regarding both the architecture and the training procedure. These models are compared with the ones usually employed within clinical research studies for both the task of imputation and post-imputation prediction. Results show that our proposed deep learning approaches outperform the standard baselines, yielding better imputation and predictive results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Missing Value Imputation Methods for Electronic Health Records

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Comparison of Machine Learning Approaches for Missing Data Imputation Among Non-Small Cell Lung Cancer Patients
D.X Yang ... S Aneja
International Journal of Radiation Oncology*Biology*Physics | VOL. 111
D.X Yang, et. al.D.X Yang ... S Aneja
22 Oct 2021
International Journal of Radiation Oncology*Biology*Physics | VOL. 111

Medical Student Use of Electronic and Paper Health Records During Inpatient Clinical Clerkships: Results of a National Longitudinal Study.
Lauren M Foster ... Maya M Hammoud
Academic medicine : journal of the Association of American Medical Colleges | VOL. 93
Lauren M Foster, et. al.Lauren M Foster ... Maya M Hammoud
01 Nov 2018
Academic medicine : journal of the Association of American Medical Colleges | VOL. 93

DEEP LEARNING-BASED APPROACH FOR MISSING DATA IMPUTATION
Pinar Ci̇han
Eskişehir Teknik Üniversitesi Bilim ve Teknoloji Dergisi B - Teorik Bilimler | VOL. 8
Pinar Ci̇hanPinar Ci̇han
31 Aug 2020
Eskişehir Teknik Üniversitesi Bilim ve Teknoloji Dergisi B - Teorik Bilimler | VOL. 8

A Novel Method for Imputing Missing Values in Ship Static Data Based on Generative Adversarial Networks
Junbo Gao ... Yingqi Jiao
Journal of Marine Science and Engineering | VOL. 11
Junbo Gao, et. al.Junbo Gao ... Yingqi Jiao
10 Apr 2023
Journal of Marine Science and Engineering | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Missing Value Imputation Methods for Electronic Health Records

Abstract

Talk to us

Similar Papers

More From: IEEE Access