Abstract

ObjectiveIn the absence of accurate perinatal records, identifying and differentiating twins in administrative records is difficult because of their similarity; they share surnames, dates of birth, and residences. In administrative databases, events within twin pairs are often incorrectly identified as being from one individual, representing a threat to data accuracy. An inability to identify twins in administrative data precludes applying powerful twin-based causal inference methods to understand health and social well-being. ApproachWe developed an algorithm using linked administrative birth, perinatal, emergency department, and hospital records in Victoria, Australia from 1 January 1993 to 31 December 2023. We probabilistically linked a sample of 1,434 ‘known’ twin pairs from the Australian Twin Registry to validate the sensitivity of our algorithm. We calculated specificity using a sample of non-twins derived from linked data. ResultsOur algorithm identified 37,900 twin pairs, 75,800 twin individuals, in the Victorian linked dataset. The accuracy of our ascertainment of twins by key characteristics will be presented and discussed. ConclusionsThe birth cohort we have generated is one of the largest twin cohorts in the world with unprecedented granularity in longitudinal health and social data. ImplicationsOur algorithm improves the accuracy of administrative data repositories and linkage, providing novel information on where errors in previous linkages have occurred due to limited familial or intergenerational information. Our large population-based birth cohort of twins can be used to investigate the familial and non-familial determinants of health using advanced causal inference which leverages the natural similarity between twins.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.