Unsupervised Deep Learning Algorithm Research Articles

OBJECTIVES The dynamic and evolving clinical trial landscape, with its nascent incorporation of real-world data, has the potential to transform healthcare. With these exciting developments come challenges in protecting patient privacy, while retaining data information and encouraging transparency and reproducibility of methods. An emerging technology to ameliorate this challenge is synthetic data generation (SDG). The promises of SDG are to provide realistic, representative, and sharable data that retains all the potential learning of the original (parent) data. We demonstrate a flexible framework for generating and evaluating synthetic data from a challenging clinical use case of chronic lymphocytic leukemia (CLL) patients. We demonstrate the potential to generate a synthetic cohort using real-world data on a cohort of patients treated with chimeric antigen receptor T-cell (CAR T) therapy. Many analytes, and their longitudinal progression, have been shown to predict clinical outcomes in CLL. Therefore, when generating synthetic data for this cohort, it is essential to preserve these longitudinal analyte 'fingerprints’ associated with other clinical information to capture latent disease progression adequately. METHODS We used Generative Adversarial Networks (GANs), a type of unsupervised deep learning algorithm, to generate synthetic CLL patients and their latent disease progression over time. 389 Patients were identified within a large tertiary healthcare system (providing care to approximately 5mil patients) by ICD9/10 codes who could provide longitudinal values for the synthetic cohort. We simulated synthetic patient data using EMR (Electronic Medical Record) data, including laboratory test values, patient-reported health state utility values (HSUVs), and other baseline characteristics. RESULTS Clinical attributes showed a strong relation between analyte trajectories and outcomes. Synthetic data was indistinguishable from original data in both statistical tests and in performance in machine learning algorithms to predict disease progression and worsening outcomes. Wasserstein Conditional GAN outperformed vanilla GAN, conditional GAN, and Wasserstein GAN. Synthetic patient data generated by GAN accurately reflect the means, standard deviations, and correlations of each variable over time to the extent that synthetic data cannot be distinguished from actual data by a logistic regression. Moreover, our unsupervised model predicts changes in total HSUVs with the same accuracy as specifically trained supervised models, additionally capturing the correlation structure of the covariates. LIMITATIONS and CONCLUSIONS Many synthetic data-generative methods emphasize retention of relationship among data elements and may preclude certain data anomalies. The real-world data may retain properties associated with the experiment or data generation process and carry them over into the synthetic cohort. The ideal synthetic cohort would support any statistical discovery possible in, and be verifiable against, the parent dataset - while reducing probability of patient identification to zero. This application of statistical tests to evaluate deep learning algorithms provides a novel perspective on synthetic data generation and poses the bases for the establishment of best practices for synthetic data quality assessment.

Read full abstract

Unsupervised Deep Learning Algorithm Research Articles

Related Topics

Articles published on Unsupervised Deep Learning Algorithm

Scan-Specific Unsupervised Highly Accelerated Non-Cartesian CEST Imaging Using Implicit Neural Representation and Explicit Sparse Prior.

Unsupervised deep metric learning algorithm for crop disease images based on knowledge distillation networks

Attention non-negative spectral clustering

Anomaly detection in automated fibre placement: learning with data limitations

A novel data-driven relationship inference approach for automatic data tagging in building heating, ventilation and air conditioning systems

SurgT challenge: Benchmark of soft-tissue trackers for robotic surgery

An unsupervised deep learning algorithm for single-site reconstruction in quantum gas microscopes

Unsupervised Deep Learning for Structural Health Monitoring

Unsupervised TCN-AE-Based Outlier Detection for Time Series With Seasonality and Trend for Cellular Networks

Regimentation of geochemical indicator elements employing convolutional deep learning algorithm

Novelty detection on a laboratory benchmark slender structure using an unsupervised deep learning algorithm

Motion Compensated Unsupervised Deep Learning for 5D MRI.

Burning Skin Detection System in Human Body

Systematic Evaluation of Synthetic Panel Data Quality with an Application to Chronic Lymphocytic Leukemia

Proposal of a new method for learning of diesel generator sounds and detecting abnormal sounds using an unsupervised deep learning algorithm

Attention‐based vector quantisation variational autoencoder for colour‐patterned fabrics defect detection

Attention-based Feature Fusion Generative Adversarial Network for yarn-dyed fabric defect detection

A New Unsupervised Deep Learning Algorithm for Fine-Grained Detection of Driver Distraction

An unsupervised machine learning approach using passive movement data to understand depression and schizophrenia

Learning Representations Using RNN Encoder-Decoder for Edge Security Control.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Unsupervised Deep Learning Algorithm Research Articles

Related Topics

Articles published on Unsupervised Deep Learning Algorithm

Scan-Specific Unsupervised Highly Accelerated Non-Cartesian CEST Imaging Using Implicit Neural Representation and Explicit Sparse Prior.

Unsupervised deep metric learning algorithm for crop disease images based on knowledge distillation networks

Attention non-negative spectral clustering

Anomaly detection in automated fibre placement: learning with data limitations

A novel data-driven relationship inference approach for automatic data tagging in building heating, ventilation and air conditioning systems

SurgT challenge: Benchmark of soft-tissue trackers for robotic surgery

An unsupervised deep learning algorithm for single-site reconstruction in quantum gas microscopes

Unsupervised Deep Learning for Structural Health Monitoring

Unsupervised TCN-AE-Based Outlier Detection for Time Series With Seasonality and Trend for Cellular Networks

Regimentation of geochemical indicator elements employing convolutional deep learning algorithm

Novelty detection on a laboratory benchmark slender structure using an unsupervised deep learning algorithm

Motion Compensated Unsupervised Deep Learning for 5D MRI.

Burning Skin Detection System in Human Body

Systematic Evaluation of Synthetic Panel Data Quality with an Application to Chronic Lymphocytic Leukemia

Proposal of a new method for learning of diesel generator sounds and detecting abnormal sounds using an unsupervised deep learning algorithm

Attention‐based vector quantisation variational autoencoder for colour‐patterned fabrics defect detection

Attention-based Feature Fusion Generative Adversarial Network for yarn-dyed fabric defect detection

A New Unsupervised Deep Learning Algorithm for Fine-Grained Detection of Driver Distraction

An unsupervised machine learning approach using passive movement data to understand depression and schizophrenia

Learning Representations Using RNN Encoder-Decoder for Edge Security Control.