Abstract

Different genomic sites evolve inter-dependently due to the combined action of epistasis, defined as a non-multiplicative contribution of alleles at different loci to genome fitness, and the physical linkage of different loci in genome. Both epistasis and linkage, partially compensated by recombination, cause correlations between allele frequencies at the loci (linkage disequilibrium, LD). The interaction and competition between epistasis and linkage are not fully understood, nor is their relative sensitivity to recombination. Modeling an adapting population in the presence of random mutation, natural selection, pairwise epistasis, and random genetic drift, we compare the contributions of epistasis and linkage. For this end, we use a panel of haplotype-based measures of LD and their various combinations calculated for epistatic and non-epistatic pairs separately. We compute the optimal percentages of detected and false positive pairs in a one-time sample of a population of moderate size. We demonstrate that true interacting pairs can be told apart in a sufficiently short genome within a narrow window of time and parameters. Outside of this parameter region, unless the population is extremely large, shared ancestry of individual sequences generates pervasive stochastic LD for non-interacting pairs masking true epistatic associations. In the presence of sufficiently strong recombination, linkage effects decrease faster than those of epistasis, and the detection of epistasis improves. We demonstrate that the epistasis component of locus association can be isolated, at a single time point, by averaging haplotype frequencies over multiple independent populations. These results demonstrate the existence of fundamental restrictions on the protocols for detecting true interactions in DNA sequence sets.

Highlights

  • Epistasis is inter-dependence of the fitness effects of mutations occurring at different loci

  • Evolution of the population between discrete generations is simulated using a Wright-Fisher model including the evolutionary factors of random mutation with the rate μ per site, random genetic drift, and natural selection, as described in Methods

  • We assume that epistatic pairs are isolated, i.e., that each genomic site interacts with only one site

Read more

Summary

Introduction

Epistasis is inter-dependence of the fitness effects of mutations occurring at different loci. The term ‘epistasis’ in population genetics refers to the fact that mutations occurring at different genomic sites affect the Darwinian fitness of an organism, i.e., its average progeny number, in a non-multiplicative fashion. Epistasis is a widespread property of biological networks [2, 5,6,7,8] and a subject of intense studies. The vital role it plays in the genetic evolution of populations and the heritability of complex traits is well established. Epistasis defines the evolutionary paths and creates fitness valleys, i.e., intermediate genetic variants with reduced fitness [10,11,12]

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.