Large language models generating synthetic clinical datasets: a feasibility and comparative analysis with real-world perioperative data

Austin A Barr,Joshua Quan,Eddie Guo,Emre Sezgin

doi:10.3389/frai.2025.1533508

Austin A Barr, Joshua Quan + Show 2 more

Open Access

https://doi.org/10.3389/frai.2025.1533508

Copy DOI

Export

Save

Cite

Journal: Frontiers in Artificial Intelligence	Publication Date: Feb 5, 2025
License type: CC BY 4.0

Abstract
Full-Text
Similar Papers

Abstract

Listen

BackgroundClinical data is instrumental to medical research, machine learning (ML) model development, and advancing surgical care, but access is often constrained by privacy regulations and missing data. Synthetic data offers a promising solution to preserve privacy while enabling broader data access. Recent advances in large language models (LLMs) provide an opportunity to generate synthetic data with reduced reliance on domain expertise, computational resources, and pre-training.ObjectiveThis study aims to assess the feasibility of generating realistic tabular clinical data with OpenAI’s GPT-4o using zero-shot prompting, and evaluate the fidelity of LLM-generated data by comparing its statistical properties to the Vital Signs DataBase (VitalDB), a real-world open-source perioperative dataset.MethodsIn Phase 1, GPT-4o was prompted to generate a dataset with qualitative descriptions of 13 clinical parameters. The resultant data was assessed for general errors, plausibility of outputs, and cross-verification of related parameters. In Phase 2, GPT-4o was prompted to generate a dataset using descriptive statistics of the VitalDB dataset. Fidelity was assessed using two-sample t-tests, two-sample proportion tests, and 95% confidence interval (CI) overlap.ResultsIn Phase 1, GPT-4o generated a complete and structured dataset comprising 6,166 case files. The dataset was plausible in range and correctly calculated body mass index for all case files based on respective heights and weights. Statistical comparison between the LLM-generated datasets and VitalDB revealed that Phase 2 data achieved significant fidelity. Phase 2 data demonstrated statistical similarity in 12/13 (92.31%) parameters, whereby no statistically significant differences were observed in 6/6 (100.0%) categorical/binary and 6/7 (85.71%) continuous parameters. Overlap of 95% CIs were observed in 6/7 (85.71%) continuous parameters.ConclusionZero-shot prompting with GPT-4o can generate realistic tabular synthetic datasets, which can replicate key statistical properties of real-world perioperative data. This study highlights the potential of LLMs as a novel and accessible modality for synthetic data generation, which may address critical barriers in clinical data access and eliminate the need for technical expertise, extensive computational resources, and pre-training. Further research is warranted to enhance fidelity and investigate the use of LLMs to amplify and augment datasets, preserve multivariate relationships, and train robust ML models.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Large language models generating synthetic clinical datasets: a feasibility and comparative analysis with real-world perioperative data

Abstract

Published Version

Talk to us

Similar Papers

More From: Frontiers in Artificial Intelligence

Lead the way for us

Similar Papers

Enhancing biomechanical machine learning with limited data: generating realistic synthetic posture data using generative artificial intelligence.
Carlo Dindorf ... Frederike Werthmann
Frontiers in bioengineering and biotechnology | VOL. 12
Carlo Dindorf, et. al.Carlo Dindorf ... Frederike Werthmann
14 Feb 2024
Frontiers in bioengineering and biotechnology | VOL. 12

A New Benchmark on Machine Learning Methodologies for Hydrological Processes Modelling: A Comprehensive Review for Limitations and Future Research Directions
Zaher Mundher Yaseen
Knowledge-Based Engineering and Sciences | VOL. 4
Zaher Mundher YaseenZaher Mundher Yaseen
31 Dec 2024
Knowledge-Based Engineering and Sciences | VOL. 4

Federated learning: A cutting-edge survey of the latest advancements and applications
Azim Akhtarshenas ... David López-Pérez
Computer Communications | VOL. 228
Azim Akhtarshenas, et. al.Azim Akhtarshenas ... David López-Pérez
30 Sep 2024
Computer Communications | VOL. 228

Comparing human text classification performance and explainability with large language and machine learning models using eye-tracking
Jeevithashree Divya Venkatesh ... Gaurav Nanda
Scientific Reports | VOL. 14
Jeevithashree Divya Venkatesh, et. al.Jeevithashree Divya Venkatesh ... Gaurav Nanda
21 Jun 2024
Scientific Reports | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Large language models generating synthetic clinical datasets: a feasibility and comparative analysis with real-world perioperative data

Abstract

Published Version

Talk to us

Similar Papers

More From: Frontiers in Artificial Intelligence