Abstract

Privacy-preserving data mining is a novel research direction in data mining and statistical databases, where data mining algorithms are analyzed for the side effects they incur in data privacy. For example, through data mining, one is able to infer sensitive information, including personal information or even patterns, from nonsensitive information or unclassified data. There have been two types of privacy concerning data mining. The first type of privacy is that the data is altered so that the mining result will preserve certain privacy. The second type of privacy is that the data is manipulated so that the mining result is not affected or minimally affected. Given specific rules to be hidden, many data altering techniques for hiding association, classification and clustering rules have been proposed. However, to specify hidden rules, entire data mining process needs to be executed. For some applications, we are only interested in hiding certain sensitive predicative rules that contain given items. In this work, we assume that only sensitive items are given and propose two algorithms, ISL (Increase Support of LHS) and DSR (Decrease Support of RHS), to modify data in database so that sensitive predicative rules containing specified items on the left hand side of rule cannot be inferred through association rule mining. Examples illustrating the proposed algorithms are given. The characteristics of the algorithms are analyzed. The efficiency of the proposed approach is further compared with Verykios etc. [2001, 2004] approach. It is shown that our approach required less number of databases scanning and prune more number of hidden rules. However, our approach must hide all rules containing the hidden items on the left hand side, where Verykios etc approach can hide any specific rule.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.