More Hybrid and Secure Protection of Statistical Datasets

Javier Herranz,Jordi Nin,Marc Sole

doi:10.1109/tdsc.2012.40

Abstract

Different methods and paradigms to protect data sets containing sensitive statistical information have been proposed and studied. The idea is to publish a perturbed version of the data set that does not leak confidential information, but that still allows users to obtain meaningful statistical values about the original data. The two main paradigms for data set protection are the classical one and the synthetic one. Recently, the possibility of combining the two paradigms, leading to a hybrid paradigm, has been considered. In this work, we first analyze the security of some synthetic and (partially) hybrid methods that have been proposed in the last years, and we conclude that they suffer from a high interval disclosure risk. We then propose the first fully hybrid SDC methods; unfortunately, they also suffer from a quite high interval disclosure risk. To mitigate this, we propose a postprocessing technique that can be applied to any data set protected with a synthetic method, with the goal of reducing its interval disclosure risk. We describe through the paper a set of experiments performed on reference data sets that support our claims.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

More Hybrid and Secure Protection of Statistical Datasets

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Dependable and Secure Computing

Lead the way for us

Similar Papers

Hybrid and Ensemble Methods of Two Days Ahead Forecasts of Electric Energy Production in a Small Wind Turbine
Paweł Piotrowski ... Marcin Kopyt
Energies | VOL. 14
Paweł Piotrowski, et. al.Paweł Piotrowski ... Marcin Kopyt
24 Feb 2021
Energies | VOL. 14

Bayesian Non-Parametric Generation Of Fully Synthetic Multivariate Categorical Data in the Presence of Structural Zeros
Daniel Manrique-Vallier ... Jingchen Hu
Journal of the Royal Statistical Society Series A: Statistics in Society | VOL. 181
Daniel Manrique-Vallier, et. al.Daniel Manrique-Vallier ... Jingchen Hu
09 Feb 2018
Journal of the Royal Statistical Society Series A: Statistics in Society | VOL. 181

Can pre-trained Transformers be used in detecting complex sensitive sentences? - A Monsanto case study
Roelien C Timmer ... David Liebowitz
-
Roelien C Timmer, et. al.Roelien C Timmer ... David Liebowitz
01 Dec 2021
01 Dec 2021

Empirical Evaluation of Mimic Software Project Data Sets for Software Effort Estimation
Maohua Gan ... Zeynep Yücel
IEICE Transactions on Information and Systems | VOL. E103.D
Maohua Gan, et. al.Maohua Gan ... Zeynep Yücel
01 Oct 2020
IEICE Transactions on Information and Systems | VOL. E103.D

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

More Hybrid and Secure Protection of Statistical Datasets

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Dependable and Secure Computing