Abstract
BackgroundWe assessed the linkage and correct linkage rate using deterministic record linkage among three commonly used Canadian databases, namely, the population registry, hospital discharge data and Vital Statistics registry.MethodsThree combinations of four personal identifiers (surname, first name, sex and date of birth) were used to determine the optimal combination. The correct linkage rate was assessed using a unique personal health number available in all three databases.ResultsAmong the three combinations, the combination of surname, sex, and date of birth had the highest linkage rate of 88.0% and 93.1%, and the second highest correct linkage rate of 96.9% and 98.9% between the population registry and Vital Statistics registry, and between the hospital discharge data and Vital Statistics registry in 2001, respectively. Adding the first name to the combination of the three identifiers above increased correct linkage by less than 1%, but at the cost of lowering the linkage rate almost by 10%.ConclusionOur findings suggest that the combination of surname, sex and date of birth appears to be optimal using deterministic linkage. The linkage and correct linkage rates appear to vary by age and the type of database, but not by sex.
Highlights
We assessed the linkage and correct linkage rate using deterministic record linkage among three commonly used Canadian databases, namely, the population registry, hospital discharge data and Vital Statistics registry
In Vital Statistics individuals were defined as Calgary Health Region (CHR) residents based on Standard Geographical Classification (SGC) for assessment of the linkage between Vital Statistics and the population registry
Linkage and correct linkage rates between vital statistics and population registry Table 1 presents the percentage of deaths in the Vital Statistics registry which can be linked with the population registry
Summary
We assessed the linkage and correct linkage rate using deterministic record linkage among three commonly used Canadian databases, namely, the population registry, hospital discharge data and Vital Statistics registry. 2) What is the correct linkage rate among the records that are linked?. The evolution from manual to computerized record linkage has helped to answer the first question. There are two commonly used computerized record linkage approaches: deterministic and probabilistic [8,9]. The deterministic record linkage approach generates links on the basis of a full agreement of a unique identifier or a set of common identifiers. This method minimizes the uncertainties in the match between two databases since only a complete match on a set of personal variables is accepted at the cost of lowering the linkage rate.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.