Hiding Association Rules by Using Confidence and Support

Elena Dasseni,Ahmed K Elmagarmid,Vassilios S Verykios,Elisa Bertino

doi:10.1007/3-540-45496-9_27

Elena Dasseni, Ahmed K Elmagarmid + Show 2 more

Open Access

https://doi.org/10.1007/3-540-45496-9_27

Copy DOI

Abstract

Large repositories of data contain sensitive information which must be protected against unauthorized access. Recent advances, in data mining and machine learning algorithms, have increased the disclosure risks one may encounter when releasing data to outside parties. A key problem, and still not sufficiently investigated, is the need to balance the confidentiality of the disclosed data with the legitimate needs of the data users. Every disclosure limitation method affects, in some way, and modifies true data values and relationships. In this paper, we investigate confidentiality issues of a broad category of rules, which are called association rules. If the disclosure risk of some of these rules are above a certain privacy threshold, those rules must be characterized as sensitive. Sometimes, sensitive rules should not be disclosed to the public since, among other things, they may be used for inferencing sensitive data, or they may provide business competitors with an advantage.

Full Text