Density-based microaggregation for statistical disclosure control

Jun-Lin Lin,Tsung-Hsien Wen,Jui-Chien Hsieh,Pei-Chann Chang

doi:10.1016/j.eswa.2009.09.054

Abstract

Protection of personal data in statistical databases has recently become a major societal concern. Statistical disclosure control (SDC) is often applied to statistical databases before they are released for public use. Microaggregation for SDC is a family of methods to protect microdata (i.e., records on individuals and/or companies) from individual identification. Microaggregation works by partitioning the microdata into groups of at least k records and, then, replacing the records in each group with the centroid of the group. An optimal microaggregation method must minimize the information loss resulting from this replacement process. However, this problem of minimizing information loss has been shown to be NP-hard for multivariate data. Methods based on various heuristics have been proposed for this problem, but none performs the best for every microdata set and various k values. This work presents a density-based algorithm (DBA) for microaggregation. The DBA first forms groups of records by the descending order of their densities, then fine-tunes these groups in reverse order. The performance of the DBA is compared against the latest microaggregation methods. Experimental results indicate that DBA incurs the least information loss for over half of the test situations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Density-based microaggregation for statistical disclosure control

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Journal: Expert Systems with Applications	Publication Date: Sep 27, 2009
Citations: 57

Similar Papers

Novel Iterative Min-Max Clustering to Minimize Information Loss in Statistical Disclosure Control
Abdun Naser Mahmood ... Abdul K Mustafa
-
Abdun Naser Mahmood, et. al.Abdun Naser Mahmood ... Abdul K Mustafa
01 Jan 2015
01 Jan 2015

A Pairwise-Systematic Microaggregation for Statistical Disclosure Control
Md Enamul Kabir ... Yanchun Zhang
-
Md Enamul Kabir, et. al.Md Enamul Kabir ... Yanchun Zhang
01 Dec 2010
01 Dec 2010

New Multi-dimensional Sorting Based K-Anonymity Microaggregation for Statistical Disclosure Control
Abdun Naser Mahmood ... Md Enamul Kabir
-
Abdun Naser Mahmood, et. al.Abdun Naser Mahmood ... Md Enamul Kabir
01 Jan 2013
01 Jan 2013

Microaggregation Sorting Framework for K-Anonymity Statistical Disclosure Control in Cloud Computing
Md Enamul Kabir ... Abdul K Mustafa
IEEE Transactions on Cloud Computing | VOL. 8
Md Enamul Kabir, et. al.Md Enamul Kabir ... Abdul K Mustafa
01 Apr 2020
IEEE Transactions on Cloud Computing | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Density-based microaggregation for statistical disclosure control

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications