The question of how to estimate a founder's proportion of a single descendant's genome has renewed relevance for outbred pedigrees, given the abundant DNA sequence data for model and nonmodel eukaryotes alike. Here we show that a donor-recurrent shortcut method can provide a robust estimation of founder proportions. In addition, we define the theoretical variance and estimate confidence intervals using a nonparameteric bootstrap method. Using actual marker data from a highly heterozygous outbred Pinus taeda pedigree, it was found that each founder's genomic proportion varied widely for each descendant, ranging from 8.8% to 38.7%. In 1 case, skewed transmission of a founder's genome could be statistically detected. Its founder proportions ranged from 1.54% to 48.46%, and its mean value was 17.59%, well below the expected value of 25%. Two-thirds of its 91 descendants had 1 or the other founder haplotype, despite 2 successive meioses. The donor-recurrent method was robust; variation for estimated founder's proportions was also wide for simulated high-density datasets whether markers were dispersed or clustered. Estimating founder contributions using this computational shortcut has broad application for highly heterozygous outbred pedigrees characterized by large sibships, low population differentiation, and shallow physical mapping resources. The relevance of this computational shortcut for outbred populations used for conservation, domestication, and evolutionary biology research is discussed.
Read full abstract