Statistical Disclosure Control for Micro-Data Using theRPackagesdcMicro

Matthias Templ,Alexander Kowarik,Bernhard Meindl

doi:10.18637/jss.v067.i04

Matthias Templ, Alexander Kowarik + Show 1 more

Open Access

https://doi.org/10.18637/jss.v067.i04

Copy DOI

Journal: Journal of Statistical Software	Publication Date: Jan 1, 2015
Citations: 56	License type: cc-by

Affiliation: TU Wien, Statistics Austria

Abstract

The demand for data from surveys, censuses or registers containing sensible information on people or enterprises has increased significantly over the last years. However, before data can be provided to the public or to researchers, confidentiality has to be respected for any data set possibly containing sensible information about individual units. Confidentiality can be achieved by applying statistical disclosure control (SDC) methods to the data in order to decrease the disclosure risk of data. The R package sdcMicro serves as an easy-to-handle, object-oriented S4 class implementation of SDC methods to evaluate and anonymize confidential micro-data sets. It includes all popular disclosure risk and perturbation methods. The package performs automated recalculation of frequency counts, individual and global risk measures, information loss and data utility statistics after each anonymization step. All methods are highly optimized in terms of computational costs to be able to work with large data sets. Reporting facilities that summarize the anonymization process can also be easily used by practitioners. We describe the package and demonstrate its functionality with a complex household survey test data set that has been distributed by the International Household Survey Network.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Statistical Disclosure Control for Micro-Data Using theRPackagesdcMicro

Abstract

Talk to us

Similar Papers

More From: Journal of Statistical Software

Lead the way for us

Similar Papers

Providing Data With High Utility And No Disclosure Risk For The Public and Researchers: An Evaluation By Advanced Statistical Disclosure Risk Methods
Matthias Templ
Austrian Journal of Statistics | VOL. 43
Matthias TemplMatthias Templ
13 Jun 2014
Austrian Journal of Statistics | VOL. 43

Measuring Disclosure Risk and Data Utility for Flexible Table Generators
Natalie Shlomo ... Laszlo Antal
Journal of Official Statistics | VOL. 31
Natalie Shlomo, et. al.Natalie Shlomo ... Laszlo Antal
01 Jun 2015
Journal of Official Statistics | VOL. 31

Disclosure risk reduction for generalized linear model output in a remote analysis system
Atikur R Khan ... Christine M O'Keefe
Data & Knowledge Engineering | VOL. 111
Atikur R Khan, et. al.Atikur R Khan ... Christine M O'Keefe
31 Jul 2017
Data & Knowledge Engineering | VOL. 111

Statistical Disclosure Control Methods for Census Frequency Tables
Natalie Shlomo
International Statistical Review | VOL. 75
Natalie ShlomoNatalie Shlomo
15 Jun 2007
International Statistical Review | VOL. 75

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Statistical Disclosure Control for Micro-Data Using theRPackagesdcMicro

Abstract

Talk to us

Similar Papers

More From: Journal of Statistical Software