A multiple imputation approach to disclosure limitation for high‐age individuals in longitudinal studies

Di An,Roderick J A Little,James W Mcnally

doi:10.1002/sim.3974

Abstract

Disclosure limitation is an important consideration in the release of public use data sets. It is particularly challenging for longitudinal data sets, since information about an individual accumulates with repeated measures over time. Research on disclosure limitation methods for longitudinal data has been very limited. We consider here problems created by high ages in cohort studies. Because of the risk of disclosure, ages of very old respondents can often not be released; in particular, this is a specific stipulation of the Health Insurance Portability and Accountability Act (HIPAA) for the release of health data for individuals. Top-coding of individuals beyond a certain age is a standard way of dealing with this issue, and it may be adequate for cross-sectional data, when a modest number of cases are affected. However, this approach leads to serious loss of information in longitudinal studies when individuals have been followed for many years. We propose and evaluate an alternative to top-coding for this situation based on multiple imputation (MI). This MI method is applied to a survival analysis of simulated data, and data from the Charleston Heart Study (CHS), and is shown to work well in preserving the relationship between hazard and covariates.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A multiple imputation approach to disclosure limitation for high‐age individuals in longitudinal studies

Abstract

Talk to us

Similar Papers

More From: Statistics in Medicine

Lead the way for us

Journal: Statistics in Medicine	Publication Date: Jun 15, 2010
Citations: 15

Similar Papers

A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection
Haiying Chen ... Thomas A Arcury
Environmental Health Perspectives | VOL. 119
Haiying Chen, et. al.Haiying Chen ... Thomas A Arcury
19 Nov 2010
Environmental Health Perspectives | VOL. 119

A comparison of two methods for the estimation of precision with incomplete longitudinal data, jointly modelled with a time-to-event outcome.
G Touloumi ... A G Babiker
Statistics in medicine | VOL. 22
G Touloumi, et. al.G Touloumi ... A G Babiker
19 Sep 2003
Statistics in medicine | VOL. 22

Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing?
Mavuto Mukaka ... Sarah A White
Trials | VOL. 17
Mavuto Mukaka, et. al.Mavuto Mukaka ... Sarah A White
22 Jul 2016
Trials | VOL. 17

Weighted multiple imputation of ethnicity data that are missing not at random in primary care databases
Tra My Pham ... Irene Petersen
International Journal of Population Data Science | VOL. 1
Tra My Pham, et. al.Tra My Pham ... Irene Petersen
13 Apr 2017
International Journal of Population Data Science | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A multiple imputation approach to disclosure limitation for high‐age individuals in longitudinal studies

Abstract

Talk to us

Similar Papers

More From: Statistics in Medicine