Abstract

Multiparental populations (MPPs) are experimental populations in which the genome of every individual is a mosaic of known founder haplotypes. These populations are useful for detecting quantitative trait loci (QTL) because tests of association can leverage inferred founder haplotype descent. It is difficult, however, to determine how haplotypes at a locus group into distinct functional alleles, termed the allelic series. The allelic series is important because it provides information about the number of causal variants at a QTL and their combined effects. In this study, we introduce a fully Bayesian model selection framework for inferring the allelic series. This framework accounts for sources of uncertainty found in typical MPPs, including the number and composition of functional alleles. Our prior distribution for the allelic series is based on the Chinese restaurant process, a relative of the Dirichlet process, and we leverage its connection to the coalescent to introduce additional prior information about haplotype relatedness via a phylogenetic tree. We evaluate our approach via simulation and apply it to QTL from two MPPs: the Collaborative Cross (CC) and the Drosophila Synthetic Population Resource (DSPR). We find that, although posterior inference of the exact allelic series is often uncertain, we are able to distinguish biallelic QTL from more complex multiallelic cases. Additionally, our allele-based approach improves haplotype effect estimation when the true number of functional alleles is small. Our method, Tree-Based Inference of Multiallelism via Bayesian Regression (TIMBR), provides new insight into the genetic architecture of QTL in MPPs.

Highlights

  • Multiparental populations (MPPs) are experimental populations in which the genome of every individual is a mosaic of known founder haplotypes

  • A linear model for the additive effect of J founder haplotypes at a genomic locus on a quantitative trait is given by yi 1⁄4 x1ibhap;1 þ x2ibhap;2 þ . . . þ xJibhap; J þ ei ; where yi is the quantitative trait measurement of individual i, xji is the number of copies of the founder j haplotype, with xji 2 f0; 1; 2g if haplotypes are known and xji 2 1⁄20; 2Š if counts are estimated as imputed dosages, bhap,j is the additive effect of the founder j haplotype, and ei is normally distributed error

  • The accuracy of allelic series inference depends on three factors: (1) the true number of functional alleles, (2) the quantitative trait locus (QTL) effect size, and (3) the level of prior information about the phylogenetic tree

Read more

Summary

Introduction

Multiparental populations (MPPs) are experimental populations in which the genome of every individual is a mosaic of known founder haplotypes. Whereas observed variant approaches such as single nucleotide polymorphism (SNP) association typically assume biallelic effects, haplotype-based association (technically a type of linkage disequilibrium mapping) tests the combined effects of all variants within the genomic interval, including any local epistastic interactions or variants that are unobserved or undiscovered (Zhang et al 2014) This permits detection of complex genetic signals that may not be revealed by single-variant approaches, an advantage that has contributed to the widespread development of MPPs across a variety of biomedically (Churchill et al 2004; Collaborative Cross Consortium 2012; Macdonald and Long 2007; King et al 2012; Kover et al 2009) and agriculturally (Huang et al 2015) important model organisms and species. We expect that sets of shared causal variants partition the haplotypes into a potentially smaller number of functionally distinct alleles, with this assignment of haplotypes to functional alleles termed the allelic series

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call