Abstract

BackgroundWe assessed the linkage and correct linkage rate using deterministic record linkage among three commonly used Canadian databases, namely, the population registry, hospital discharge data and Vital Statistics registry.MethodsThree combinations of four personal identifiers (surname, first name, sex and date of birth) were used to determine the optimal combination. The correct linkage rate was assessed using a unique personal health number available in all three databases.ResultsAmong the three combinations, the combination of surname, sex, and date of birth had the highest linkage rate of 88.0% and 93.1%, and the second highest correct linkage rate of 96.9% and 98.9% between the population registry and Vital Statistics registry, and between the hospital discharge data and Vital Statistics registry in 2001, respectively. Adding the first name to the combination of the three identifiers above increased correct linkage by less than 1%, but at the cost of lowering the linkage rate almost by 10%.ConclusionOur findings suggest that the combination of surname, sex and date of birth appears to be optimal using deterministic linkage. The linkage and correct linkage rates appear to vary by age and the type of database, but not by sex.

Highlights

  • We assessed the linkage and correct linkage rate using deterministic record linkage among three commonly used Canadian databases, namely, the population registry, hospital discharge data and Vital Statistics registry

  • In Vital Statistics individuals were defined as Calgary Health Region (CHR) residents based on Standard Geographical Classification (SGC) for assessment of the linkage between Vital Statistics and the population registry

  • Linkage and correct linkage rates between vital statistics and population registry Table 1 presents the percentage of deaths in the Vital Statistics registry which can be linked with the population registry

Read more

Summary

Introduction

We assessed the linkage and correct linkage rate using deterministic record linkage among three commonly used Canadian databases, namely, the population registry, hospital discharge data and Vital Statistics registry. 2) What is the correct linkage rate among the records that are linked?. The evolution from manual to computerized record linkage has helped to answer the first question. There are two commonly used computerized record linkage approaches: deterministic and probabilistic [8,9]. The deterministic record linkage approach generates links on the basis of a full agreement of a unique identifier or a set of common identifiers. This method minimizes the uncertainties in the match between two databases since only a complete match on a set of personal variables is accepted at the cost of lowering the linkage rate.

Methods
Results
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.