Combining attention with spectrum to handle missing values on time series data without imputation

Yen-Pin Chen,Chien-Hua Huang,Yuan-Hsun Lo,Yi-Ying Chen,Feipei Lai

doi:10.1016/j.ins.2022.07.124

Abstract

In the development of predictive models, the problem of missing data is a critical issue that traditionally requires a two-step analysis. Data scientists analyze the patterns of missing values, select variables, impute missing values on the basis of domain knowledge, and then train a model. Models typically have their input sizes hardcoded, and have limitations in handling data with high missing rates or changes in available variables. We propose an attention-based neural network combined with a novel real number representation, which requires little work on manually selecting variables, and in which missing data can be overlooked, making imputation unnecessary. In this proposed model, data analysis can be one step, omitting the first step of imputing missing values. The study included data on 32,709 intensive care unit (ICU) admissions and 60 healthcare variables from the Medical Information Mart for Intensive Care (MIMIC)-IV. The proposed algorithm yielded an area under the receiver operating characteristic curve (AUC) of 0.842 (95% CIs: 0.828–0.856) when predicting prolonged length of stay in the ICU, outperforming current approaches using imputation methods. The proposed algorithm can be applied to a range of problems in data science, as it addresses the issue of incomplete data with automatic variable selection.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information Sciences	Publication Date: Jul 28, 2022
Citations: 9	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Combining attention with spectrum to handle missing values on time series data without imputation

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Similar Papers

Development and validation of a machine-learning model for prediction of hypoxemia after extubation in intensive care units
Ming Xia ... Hong Jiang
Annals of Translational Medicine | VOL. 10
Ming Xia, et. al.Ming Xia ... Hong Jiang
01 May 2022
Annals of Translational Medicine | VOL. 10

Description of Clinical Characteristics of VAP Patients in MIMIC Database
Qingqing Liu ... Fanfan Zhao
Frontiers in Pharmacology | VOL. 10
Qingqing Liu, et. al.Qingqing Liu ... Fanfan Zhao
04 Feb 2019
Frontiers in Pharmacology | VOL. 10

Transformation and Evaluation of the MIMIC Database in the OMOP Common Data Model: Development and Usability Study.
Nicolas Paris ... Adrien Parrot
JMIR Medical Informatics | VOL. 9
Nicolas Paris, et. al.Nicolas Paris ... Adrien Parrot
14 Dec 2021
JMIR Medical Informatics | VOL. 9

Aspirin Therapy and 28-Day Mortality in ICU Patients: A Retrospective Observational Study From Two Large Databases
Luhao Wang ... Xiangdong Guan
Clinical Therapeutics | VOL. 45
Luhao Wang, et. al.Luhao Wang ... Xiangdong Guan
25 Mar 2023
Clinical Therapeutics | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combining attention with spectrum to handle missing values on time series data without imputation

Abstract

Talk to us

Similar Papers

More From: Information Sciences