Preserving edits when perturbing microdata for statistical disclosure control

Natalie Shlomo,Ton De Waal

doi:10.3233/sju-2005-22207

Abstract

To protect individuals in microdata from the risk of re-identification, a general perturbative method called PRAM (the Post-Randomization Method) is sometimes used for masking records. This method adds “noise” to categorical variables by changing values of categories for a small number of records according to a prescribed probability matrix and a stochastic process based on the outcome of a random multinomial draw. Changing values of categorical variables, however, will cause fully edited and clean records in microdata to start failing edit constraints resulting in data of low utility. In addition, an inconsistent record pinpoints to a potential attacker that the record was perturbed and attempts can be made to unmask the data. Therefore, the perturbation process must take into account micro edit constraints which will ensure that perturbed microdata satisfy all edits. Macro edit constraints which take the form of information loss measures also need to be defined in order to ensure that the overall utility of the data will not be badly compromised given an acceptable level of disclosure risk. This paper will discuss methods for perturbing microdata using PRAM while minimizing micro and macro edit failures. (Updated 10th August 2005)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Preserving edits when perturbing microdata for statistical disclosure control

Abstract

Talk to us

Similar Papers

More From: Statistical Journal of the United Nations Economic Commission for Europe

Lead the way for us

Journal: Statistical Journal of the United Nations Economic Commission for Europe	Publication Date: Jan 9, 2006
Citations: 10

Similar Papers

K-Anonymous Microdata Release via Post Randomisation Method
Dai Ikarashi ... Koji Chida
-
Dai Ikarashi, et. al.Dai Ikarashi ... Koji Chida
01 Jan 2015
01 Jan 2015

Evaluating the Risk of Re-identification of Patients from Hospital Prescription Records.
Khaled El Emam ... Régis Vaillancourt
The Canadian Journal of Hospital Pharmacy | VOL. 62
Khaled El Emam, et. al.Khaled El Emam ... Régis Vaillancourt
23 Jul 2009
The Canadian Journal of Hospital Pharmacy | VOL. 62

Logistic Regression with Variables Subject to Post Randomization Method
Yong Ming Jeffrey Woo ... Aleksandra B Slavković
-
Yong Ming Jeffrey Woo, et. al.Yong Ming Jeffrey Woo ... Aleksandra B Slavković
01 Jan 2012
01 Jan 2012

The Analysis of Multivariate Misclassified Data With Special Attention to Randomized Response Data
Ardo Van Den Hout ... Peter G M Van Der Heijden
Sociological Methods & Research | VOL. 32
Ardo Van Den Hout, et. al.Ardo Van Den Hout ... Peter G M Van Der Heijden
01 Feb 2004
Sociological Methods & Research | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Preserving edits when perturbing microdata for statistical disclosure control

Abstract

Talk to us

Similar Papers

More From: Statistical Journal of the United Nations Economic Commission for Europe