Privacy-preserving record linkage using Bloom filters

Rainer Schnell,Jörg Reiher,Tobias Bachteler

doi:10.1186/1472-6947-9-41

Rainer Schnell, Jörg Reiher + Show 1 more

Open Access

https://doi.org/10.1186/1472-6947-9-41

Copy DOI

Abstract

BackgroundCombining multiple databases with disjunctive or additional information on the same person is occurring increasingly throughout research. If unique identification numbers for these individuals are not available, probabilistic record linkage is used for the identification of matching record pairs. In many applications, identifiers have to be encrypted due to privacy concerns.MethodsA new protocol for privacy-preserving record linkage with encrypted identifiers allowing for errors in identifiers has been developed. The protocol is based on Bloom filters on q-grams of identifiers.ResultsTests on simulated and actual databases yield linkage results comparable to non-encrypted identifiers and superior to results from phonetic encodings.ConclusionWe proposed a protocol for privacy-preserving record linkage with encrypted identifiers allowing for errors in identifiers. Since the protocol can be easily enhanced and has a low computational burden, the protocol might be useful for many applications requiring privacy-preserving record linkage.

Highlights

Medical databases of people usually contain identifiers like surnames, given names, date of birth, and address information
There are some intriguing approaches proposed in the literature, these have a number of problems, for instance they involve very high computing demands or high rates of false positives or false negatives
We suggest the use of Bloom filters for solving this problem

Summary

Introduction

Since the identifiers agree exactly if their corresponding hash values agree, the third party can link matching records without knowing the identifiers. Variants of this protocol using exact matching have been published [12,13]. Combining multiple databases with disjunctive or additional information on the same person is occurring increasingly throughout medical research. The availability of large medical databases and unique person identifier (ID) numbers has made widespread use of record linkage possible. In many research applications not all databases contain a unique ID number In such situations, probabilistic record linkage is most frequently applied for the identification of matching record pairs [1]. We developed a new procedure which addresses these problems

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Medical Informatics and Decision Making	Publication Date: Aug 25, 2009
Citations: 317	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Privacy-preserving record linkage using Bloom filters

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Informatics and Decision Making

Lead the way for us

Similar Papers

Scalable and approximate privacy-preserving record linkage

-

09 Dec 2014
09 Dec 2014

Secure Privacy Preserving Record Linkage of Large Databases by Modified Bloom Filter Encodings.
Rainer Schnell ... Christian Borgs
International journal of population data science | VOL. 1
Rainer Schnell, et. al.Rainer Schnell ... Christian Borgs
13 Apr 2017
International journal of population data science | VOL. 1

Evaluating privacy-preserving record linkage using cryptographic long-term keys and multibit trees on large medical datasets
Adrian P Brown ... Sean M Randall
BMC Medical Informatics and Decision Making | VOL. 17
Adrian P Brown, et. al.Adrian P Brown ... Sean M Randall
08 Jun 2017
BMC Medical Informatics and Decision Making | VOL. 17

Optimization of the Mainzelliste software for fast privacy-preserving record linkage
Florens Rohde ... Erhard Rahm
Journal of Translational Medicine | VOL. 19
Florens Rohde, et. al.Florens Rohde ... Erhard Rahm
15 Jan 2021
Journal of Translational Medicine | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Privacy-preserving record linkage using Bloom filters

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Informatics and Decision Making