Precise and Fast Cryptanalysis for Bloom Filter Based Privacy-Preserving Record Linkage

Peter Christen,Thilina Ranbaduge,Dinusha Vatsalan,Rainer Schnell

doi:10.1109/tkde.2018.2874004

Abstract

Being able to identify records that correspond to the same entity across diverse databases is an increasingly important step in many data analytics projects. Research into privacy-preserving record linkage (PPRL) aims to develop techniques that can link records across databases such that besides the record pairs classified as matches no sensitive information about the entities in these databases is revealed. A popular technique used in PPRL is to encode sensitive values into Bloom filters (bit vectors), which has the advantage of allowing approximate matching using character q-grams. PPRL based on Bloom filter encoding has been shown to be accurate and scalable to large databases, and is thus now being used in real-world PPRL systems in Australia, Canada, and the UK. However, recent studies have shown that Bloom filters used for PPRL are vulnerable to cryptanalysis attacks that can re-identify some of the sensitive values encoded in these Bloom filters. While previous such attack methods were slow and required knowledge of various encoding parameters, we present a novel efficient attack which exploits how attribute values are encoded into Bloom filters. Our attack method does not require knowledge of the encoding function or its parameter settings used. It is able to correctly re-identify with high precision q-grams that could not have been hashed to certain Bloom filter bit positions, and using these re-identified q-grams it can then re-identify attribute values with high precision. Our method is significantly faster than earlier PPRL cryptanalysis attacks, and in our experimental evaluation, it is able to successfully re-identify attribute values from large real-world databases in a few minutes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Precise and Fast Cryptanalysis for Bloom Filter Based Privacy-Preserving Record Linkage

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Nov 1, 2019
Citations: 61

Similar Papers

Evaluating hardening techniques against cryptanalysis attacks on Bloom filter encodings for record linkage
Thilina Ranbaduge ... Rainer Schnell
International Journal of Population Data Science | VOL. 3
Thilina Ranbaduge, et. al.Thilina Ranbaduge ... Rainer Schnell
28 Aug 2018
International Journal of Population Data Science | VOL. 3

Secure Privacy Preserving Record Linkage of Large Databases by Modified Bloom Filter Encodings.
Rainer Schnell ... Christian Borgs
International journal of population data science | VOL. 1
Rainer Schnell, et. al.Rainer Schnell ... Christian Borgs
13 Apr 2017
International journal of population data science | VOL. 1

Efficient Cryptanalysis of Bloom Filters for Privacy-Preserving Record Linkage
Peter Christen ... Thilina Ranbaduge
-
Peter Christen, et. al.Peter Christen ... Thilina Ranbaduge
01 Jan 2017
01 Jan 2017

Scalable and approximate privacy-preserving record linkage

-

09 Dec 2014
09 Dec 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Precise and Fast Cryptanalysis for Bloom Filter Based Privacy-Preserving Record Linkage

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering