Research Paper: Process Mining and Synthetic Health Data: Reflections and Lessons Learnt

Alistair Bullward,Abdulaziz Aljebreen,Ciarán Mcinerney,Alexander Coles,Owen Johnson

doi:10.1007/978-3-031-27815-0_25

Abstract

AbstractAnalysing the treatment pathways in real-world health data can provide valuable insight for clinicians and decision-makers. However, the procedures for acquiring real-world data for research can be restrictive, time-consuming and risks disclosing identifiable information. Synthetic data might enable representative analysis without direct access to sensitive data. In the first part of our paper, we propose an approach for grading synthetic data for process analysis based on its fidelity to relationships found in real-world data. In the second part, we apply our grading approach by assessing cancer patient pathways in a synthetic healthcare dataset (The Simulacrum provided by the English National Cancer Registration and Analysis Service) using process mining. Visualisations of the patient pathways within the synthetic data appear plausible, showing relationships between events confirmed in the underlying non-synthetic data. Data quality issues are also present within the synthetic data which reflect real-world problems and artefacts from the synthetic dataset’s creation. Process mining of synthetic data in healthcare is an emerging field with novel challenges. We conclude that researchers should be aware of the risks when extrapolating results produced from research on synthetic data to real-world scenarios and assess findings with analysts who are able to view the underlying data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Research Paper: Process Mining and Synthetic Health Data: Reflections and Lessons Learnt

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Creating High-Quality Synthetic Health Data: Framework for Model Development and Validation.
Elnaz Karimian Sichani ... Khaled El Emam
JMIR formative research | VOL. 8
Elnaz Karimian Sichani, et. al.Elnaz Karimian Sichani ... Khaled El Emam
02 Oct 2023
JMIR formative research | VOL. 8

An evaluation of the replicability of analyses using synthetic health data
Khaled El Emam ... Alaa El-Hussuna
Scientific Reports | VOL. 14
Khaled El Emam, et. al.Khaled El Emam ... Alaa El-Hussuna
24 Mar 2024
Scientific Reports | VOL. 14

Enriching Data Science and Health Care Education: Application and Impact of Synthetic Data Sets Through the Health Gym Project.
Nicholas I-Hsien Kuo ... Sanja Lujic
JMIR medical education | VOL. 10
Nicholas I-Hsien Kuo, et. al.Nicholas I-Hsien Kuo ... Sanja Lujic
16 Jan 2024
JMIR medical education | VOL. 10

Characterization of Synthetic Health Data Using Rule-Based Artificial Intelligence Models.
Marta Lenatti ... Maurizio Mongelli
IEEE Journal of Biomedical and Health Informatics | VOL. 27
Marta Lenatti, et. al.Marta Lenatti ... Maurizio Mongelli
01 Aug 2023
IEEE Journal of Biomedical and Health Informatics | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Research Paper: Process Mining and Synthetic Health Data: Reflections and Lessons Learnt

Abstract

Talk to us

Similar Papers