Abstract
BackgroundThe genealogical histories of individuals within populations are of interest to studies aiming both to uncover detailed pedigree information and overall quantitative population demographic histories. However, the analysis of quantitative details of individual genealogical histories has faced challenges from incomplete available pedigree records and an absence of objective and quantitative details in pedigree information. Although complete pedigree information for most individuals is difficult to track beyond a few generations, it is possible to describe a person’s genealogical history using their genetic relatives revealed by identity by descent (IBD) segments—long genomic segments shared by two individuals within a population, which are identical due to inheritance from common ancestors. When modern biobanks collect genotype information for a significant fraction of a population, dense genetic connections of a person can be traced using such IBD segments, offering opportunities to characterize individuals in the context of the underlying populations. Here, we conducted an individual-centric analysis of IBD segments among the UK Biobank participants that represent 0.7% of the UK population.ResultsWe made a high-quality call set of IBD segments over 5 cM among all 500,000 UK Biobank participants. On average, one UK individual shares IBD segments with 14,000 UK Biobank participants, which we refer to as “relatives.” Using these segments, approximately 80% of a person’s genome can be imputed. We subsequently propose genealogical descriptors based on the genetic connections of relative cohorts of individuals sharing at least one IBD segment and show that such descriptors offer important information about one’s genetic makeup, personal genealogical history, and social behavior. Through analysis of relative counts sharing segments at different lengths, we identified a group, potentially British Jews, who has a distinct pattern of familial expansion history. Finally, using the enrichment of relatives in one’s neighborhood, we identified regional variations of personal preference favoring living closer to one’s extended families.ConclusionsOur analysis revealed genetic makeup, personal genealogical history, and social behaviors at the population scale, opening possibilities for further studies of individual’s genetic connections in biobank data.
Highlights
The genealogical histories of individuals within populations are of interest to studies aiming both to uncover detailed pedigree information and overall quantitative population demographic histories
This translates to, on average, each person having 5 cM genetic connections to 14,000, or about 3% of UKBB persons, whom we call genetic relatives or relatives for short. These segments offer on average 10x coverage of the overall diploid genome for a UK individual. Noting that this density of identity by descent (IBD) segments is higher than that reported by existing studies [10], we took extra caution assessing the quality of our results
Since quality assessment of IBD segment calls of a large cohort is a less-studied problem, we developed a strategy for evaluating IBD segment calls in an “unsupervised” fashion: First, we compared the kinship coefficients derived from RaPID’s IBD calls against a standard genotype-based relatedness caller, KING [11]
Summary
The genealogical histories of individuals within populations are of interest to studies aiming both to uncover detailed pedigree information and overall quantitative population demographic histories. Complete pedigree information for most individuals is difficult to track beyond a few generations, it is possible to describe a person’s genealogical history using their genetic relatives revealed by identity by descent (IBD) segments—long genomic segments shared by two individuals within a population, which are identical due to inheritance from common ancestors. Thanks to the establishment of modern biobanks and direct-to-consumer (DTC) genetic testing companies which contain genotype information of a relatively large fraction (0.1–5%) of a population, it is possible to identify dense genetic connections, as manifested by identical-by-descent (IBD) segments, among individuals. IBD segments are shared DNA segments between two individuals that are inherited from a common ancestor This opens the possibility to study the quantitative details of one’s genealogical history and identify emerging patterns
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.