Generating synthetic mixed-type longitudinal electronic health records for artificial intelligent applications

Jin Li,Tingting Zhu,Jingsong Li,Benjamin J Cairns

doi:10.1038/s41746-023-00834-7

Jin Li, Tingting Zhu + Show 2 more

Open Access

https://doi.org/10.1038/s41746-023-00834-7

Copy DOI

Abstract

The recent availability of electronic health records (EHRs) have provided enormous opportunities to develop artificial intelligence (AI) algorithms. However, patient privacy has become a major concern that limits data sharing across hospital settings and subsequently hinders the advances in AI. Synthetic data, which benefits from the development and proliferation of generative models, has served as a promising substitute for real patient EHR data. However, the current generative models are limited as they only generate singletype of clinical data for a synthetic patient, i.e., either continuous-valued or discrete-valued. To mimic the nature of clinical decision-making which encompasses various data types/sources, in this study, we propose a generative adversarial network (GAN) entitled EHR-M-GAN that simultaneously synthesizes mixed-type timeseries EHR data. EHR-M-GAN is capable of capturing the multidimensional, heterogeneous, and correlated temporal dynamics in patient trajectories. We have validated EHR-M-GAN on three publicly-available intensive care unit databases with records from a total of 141,488 unique patients, and performed privacy risk evaluation of the proposed model. EHR-M-GAN has demonstrated its superiority over state-of-the-art benchmarks for synthesizing clinical timeseries with high fidelity, while addressing the limitations regarding data types and dimensionality in the current generative models. Notably, prediction models for outcomes of intensive care performed significantly better when training data was augmented with the addition of EHR-M-GAN-generated timeseries. EHR-M-GAN may have use in developing AI algorithms in resource-limited settings, lowering the barrier for data acquisition while preserving patient privacy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: npj Digital Medicine	Publication Date: May 27, 2023
Citations: 29	License type: open-access

R Discovery Prime

R Discovery Prime

Generating synthetic mixed-type longitudinal electronic health records for artificial intelligent applications

Abstract

Talk to us

Similar Papers

More From: npj Digital Medicine

Lead the way for us

Similar Papers

Response to M. Trengove & coll regarding "Attention is not all you need: the complicated case of ethically using large language models in healthcare and medicine".
Stefan Harrer
eBioMedicine | VOL. 93
Stefan HarrerStefan Harrer
01 Jul 2023
eBioMedicine | VOL. 93

Generating Synthetic Electronic Health Record Data Using Generative Adversarial Networks: Tutorial.
Chao Yan ... Ziqi Zhang
JMIR AI | VOL. 3
Chao Yan, et. al.Chao Yan ... Ziqi Zhang
22 Apr 2024
JMIR AI | VOL. 3

Artificial Intelligence in Medicine: Revolutionizing Healthcare for Improved Patient Outcomes
Varshil Mehta
Journal of Medical Research and Innovation | VOL. 7
Varshil MehtaVarshil Mehta
03 Jun 2023
Journal of Medical Research and Innovation | VOL. 7

The application of unsupervised deep learning in predictive models using electronic health records
Lei Wang ... Liping Tong
BMC Medical Research Methodology | VOL. 20
Lei Wang, et. al.Lei Wang ... Liping Tong
26 Feb 2020
BMC Medical Research Methodology | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generating synthetic mixed-type longitudinal electronic health records for artificial intelligent applications

Abstract

Talk to us

Similar Papers

More From: npj Digital Medicine