Validity of Privacy-Protecting Analytical Methods That Use Only Aggregate-Level Information to Conduct Multivariable-Adjusted Analysis in Distributed Data Networks.

Xiaojuan Li,Jeffrey R Curtis,David P Fisher,Sengwee Toh,Marsha A Raebel,Érick Moyneur,David E Arterburn,Lindsay Lagreid,W Benjamin Nowell,Bruce H Fireman,Mia Gallagher

doi:10.1093/aje/kwy265

Abstract

Distributed data networks enable large-scale epidemiologic studies, but protecting privacy while adequately adjusting for a large number of covariates continues to pose methodological challenges. Using 2 empirical examples within a 3-site distributed data network, we tested combinations of 3 aggregate-level data-sharing approaches (risk-set, summary-table, and effect-estimate), 4 confounding adjustment methods (matching, stratification, inverse probability weighting, and matching weighting), and 2 summary scores (propensity score and disease risk score) for binary and time-to-event outcomes. We assessed the performance of combinations of these data-sharing and adjustment methods by comparing their results with results from the corresponding pooled individual-level data analysis (reference analysis). For both types of outcomes, the method combinations examined yielded results identical or comparable to the reference results in most scenarios. Within each data-sharing approach, comparability between aggregate- and individual-level data analysis depended on adjustment method; for example, risk-set data-sharing with matched or stratified analysis of summary scores produced identical results, while weighted analysis showed some discrepancies. Across the adjustment methods examined, risk-set data-sharing generally performed better, while summary-table and effect-estimate data-sharing more often produced discrepancies in settings with rare outcomes and small sample sizes. Valid multivariable-adjusted analysis can be performed in distributed data networks without sharing of individual-level data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Validity of Privacy-Protecting Analytical Methods That Use Only Aggregate-Level Information to Conduct Multivariable-Adjusted Analysis in Distributed Data Networks.

Abstract

Talk to us

Similar Papers

More From: American Journal of Epidemiology

Lead the way for us

Journal: American Journal of Epidemiology	Publication Date: Dec 7, 2018
Citations: 30

Similar Papers

Comparison of privacy-protecting analytic and data-sharing methods: A simulation study.
Kazuki Yoshida ... Sengwee Toh
Pharmacoepidemiology and Drug Safety | VOL. 27
Kazuki Yoshida, et. al.Kazuki Yoshida ... Sengwee Toh
18 Jul 2018
Pharmacoepidemiology and Drug Safety | VOL. 27

PpmHR: A Privacy-protecting Tool to Fit Inverse Probability Weighted Cox Models in Multisite Studies.
Di Shu ... Sengwee Toh
Epidemiology | VOL. 32
Di Shu, et. al.Di Shu ... Sengwee Toh
05 Nov 2020
Epidemiology | VOL. 32

DataSHIELD: resolving a conflict in contemporary bioscience--performing a pooled analysis of individual-level data without sharing the data
M Wolfson ... J Macleod
International Journal of Epidemiology | VOL. 39
M Wolfson, et. al.M Wolfson ... J Macleod
14 Jul 2010
International Journal of Epidemiology | VOL. 39

Covariate balance-related propensity score weighting in estimating overall hazard ratio with distributed survival data
Chen Huang ... Guoyou Qin
BMC Medical Research Methodology | VOL. 23
Chen Huang, et. al.Chen Huang ... Guoyou Qin
13 Oct 2023
BMC Medical Research Methodology | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Validity of Privacy-Protecting Analytical Methods That Use Only Aggregate-Level Information to Conduct Multivariable-Adjusted Analysis in Distributed Data Networks.

Abstract

Talk to us

Similar Papers

More From: American Journal of Epidemiology