Abstract

Little attention has been paid to the measurement of risk to privacy in Database Management Systems, despite their prevalence as a modality of data access. This paper proposes PriDe, a quantitative privacy metric that provides a measure (privacy score) of privacy risk when executing queries in relational database management systems. PriDe measures the degree to which attribute values, retrieved by a principal (user) engaging in an interactive query session, represent a reduction of privacy with respect to the attribute values previously retrieved by the principal. It can be deployed in interactive query settings where the user sends SQL queries to the database and gets results at run-time and provides privacy-conscious organizations with a way to monitor the usage of the application data made available to third parties in terms of privacy. The proposed approach, without loss of generality, is applicable to BigSQL-style technologies. Additionally, the paper proposes a privacy equivalence relation that facilitates the computation of the privacy score.

Highlights

  • The recent past has witnessed an exponential increase in the amount of data being collected by contemporary organizations

  • It is a measure of the degree to which attribute values, retrieved by a principal engaged in an interactive query session, may represent a reduction of privacy with respect to the attribute values previously retrieved by the principal from the Relational Database Management System (RDBMS)

  • We introduced Privacy Distance (PriDe), which computes a privacy score within the framework of a Relational Database Management System (RDBMS)

Read more

Summary

INTRODUCTION

The recent past has witnessed an exponential increase in the amount of data being collected by contemporary organizations. The contribution of this paper is the construction of a metric that objectively measures privacy risks by providing data curators with a score within the RDBMS framework. PriDe computes a run-time score of the privacy risk that arises from a query to the database It is a measure of the degree to which attribute values, retrieved by a principal engaged in an interactive query session, may represent a reduction of privacy with respect to the attribute values previously retrieved by the principal from the RDBMS. This paper is a significant revision of the paper (Khan et al, 2019c) and extends this work, with the treatment of cold-start (absence of baseline profile - past querying behavior), experimental results pertaining to the evaluation of the cold-start scenario and includes discussions, for instance, on the a use-case of computing privacy score.

RELATED WORK
Measuring Privacy
PRIDE—THE PRIVACY SCORE MODEL
Modeling Querying Behavior
Comparing Profiles
Distance Between N-Grams
Privacy Equivalence Between Attributes
Defining Privacy Equivalence Relation
Cumulative Privacy Score
COMPUTING PRIVACY SCORE
Use-Case
Modeling Querying Behavior Using Machine Learning
Application in Detection of SQL Injections
CONCLUSIONS
DATA AVAILABILITY STATEMENT
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.