Abstract

Abstract Notwithstanding the potential danger to security and privacy, sharing and publishing data has become usual in Data Science. To preserve privacy, de-identification methodologies guided by risk estimation have been designed. Two issues associated with classical risk metrics are, on the one hand, the adequacy of the metric and, on the other hand, its static nature. In this paper, we present metrics for estimating risk based on the emerging semantics provided by Formal Concept Analysis. The metrics are designed to estimate the a priori risk of compromised data deletion. Furthermore, by applying specialized variable forgetting methods for association rules, it is shown how to reflect the effect of deleting attributes belonging to potentially dangerous quasi-identifier sets. Additionally, a study of the role of the risk metric in confidence-based reasoning for re-identification is presented.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.