Practical data-oriented microaggregation for statistical disclosure control

J Domingo-Ferrer,J.M Mateo-Sanz

doi:10.1109/69.979982

Abstract

Microaggregation is a statistical disclosure control technique for microdata disseminated in statistical databases. Raw microdata (i.e., individual records or data vectors) are grouped into small aggregates prior to publication. Each aggregate should contain at least k data vectors to prevent disclosure of individual information, where k is a constant value preset by the data protector. No exact polynomial algorithms are known to date to microaggregate optimally, i.e., with minimal variability loss. Methods in the literature rank data and partition them into groups of fixed-size; in the multivariate case, ranking is performed by projecting data vectors onto a single axis. In this paper, candidate optimal solutions to the multivariate and univariate microaggregation problems are characterized. In the univariate case, two heuristics based on hierarchical clustering and genetic algorithms are introduced which are data-oriented in that they try to preserve natural data aggregates. In the multivariate case, fixed-size and hierarchical clustering microaggregation algorithms are presented which do not require data to be projected onto a single dimension; such methods clearly reduce variability loss as compared to conventional multivariate microaggregation on projected data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Practical data-oriented microaggregation for statistical disclosure control

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Jan 1, 2002
Citations: 549

Similar Papers

On the complexity of optimal microaggregation for statistical disclosure control
Anna Oganian ... Josep Domingo-Ferrer
Statistical Journal of the United Nations Economic Commission for Europe | VOL. 18
Anna Oganian, et. al.Anna Oganian ... Josep Domingo-Ferrer
28 Dec 2001
Statistical Journal of the United Nations Economic Commission for Europe | VOL. 18

Security-control methods for statistical databases: a comparative study
Nabil R Adam ... John C Worthmann
ACM Computing Surveys | VOL. 21
Nabil R Adam, et. al.Nabil R Adam ... John C Worthmann
01 Dec 1989
ACM Computing Surveys | VOL. 21

A Pairwise-Systematic Microaggregation for Statistical Disclosure Control
Md Enamul Kabir ... Yanchun Zhang
-
Md Enamul Kabir, et. al.Md Enamul Kabir ... Yanchun Zhang
01 Dec 2010
01 Dec 2010

Initial application of ant colony optimisation to statistical disclosure control
Martin Serpell ... James Smith
-
Martin Serpell, et. al.Martin Serpell ... James Smith
06 Jul 2013
06 Jul 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Practical data-oriented microaggregation for statistical disclosure control

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering