A data-driven approach to choosing privacy parameters for clinical trial data sharing under differential privacy.

Henian Chen,Yayi Zhao,Spencer Giddens,Matthew J Valente,Joseph Ficek,Biwei Cao,Ellen Daley,Jinyong Pang

doi:10.1093/jamia/ocae038

Abstract

Clinical trial data sharing is crucial for promoting transparency and collaborative efforts in medical research. Differential privacy (DP) is a formal statistical technique for anonymizing shared data that balances privacy of individual records and accuracy of replicated results through a "privacy budget" parameter, ε. DP is considered the state of the art in privacy-protected data publication and is underutilized in clinical trial data sharing. This study is focused on identifying ε values for the sharing of clinical trial data. We analyzed 2 clinical trial datasets with privacy budget ε ranging from 0.01 to 10. Smaller values of ε entail adding greater amounts of random noise, with better privacy as a result. Comparison of rates, odds ratios, means, and mean differences between the original clinical trial datasets and the empirical distribution of the DP estimator was performed. The DP rate closely approximated the original rate of 6.5% when ε > 1. The DP odds ratio closely aligned with the original odds ratio of 0.689 when ε ≥ 3. The DP mean closely approximated the original mean of 164.64 when ε ≥ 1. As ε increased to 5, both the minimum and maximum DP means converged toward the original mean. There is no consensus on how to choose the privacy budget ε. The definition of DP does not specify the required level of privacy, and there is no established formula for determining ε. Our findings suggest that the application of DP holds promise in the context of sharing clinical trial data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A data-driven approach to choosing privacy parameters for clinical trial data sharing under differential privacy.

Abstract

Talk to us

Similar Papers

More From: Journal of the American Medical Informatics Association : JAMIA

Lead the way for us

Similar Papers

Access to data from clinical trials in the COVID-19 crisis: open, flexible, and time-sensitive
Michael Ewers ... Nikolaus Plesnila
Journal of Clinical Epidemiology | VOL. 130
Michael Ewers, et. al.Michael Ewers ... Nikolaus Plesnila
14 Oct 2020
Journal of Clinical Epidemiology | VOL. 130

Evaluating the Utility and Privacy of Synthetic Breast Cancer Clinical Trial Data Sets.
Samer El Kababji ... Ana-Alicia Beltran-Bless
JCO clinical cancer informatics | VOL. 7
Samer El Kababji, et. al.Samer El Kababji ... Ana-Alicia Beltran-Bless
01 Sep 2023
JCO clinical cancer informatics | VOL. 7

Optimizing the synthesis of clinical trial data using sequential trees.
Khaled El Emam ... Chaoyi Zheng
Journal of the American Medical Informatics Association : JAMIA | VOL. 28
Khaled El Emam, et. al.Khaled El Emam ... Chaoyi Zheng
13 Nov 2020
Journal of the American Medical Informatics Association : JAMIA | VOL. 28

More transparency for clinical trial data: The decision by the European Medicines Agency to make clinical trial reports publicly available could provide a boon for biomedical research.
Philip Hunter
EMBO reports | VOL. 16
Philip HunterPhilip Hunter
04 Dec 2014
EMBO reports | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A data-driven approach to choosing privacy parameters for clinical trial data sharing under differential privacy.

Abstract

Talk to us

Similar Papers

More From: Journal of the American Medical Informatics Association : JAMIA