Abstract

Estimating the extent of genetic differentiation between populations is an important measure in population genetics, ecology and evolutionary biology. The fixation index, or FST, is an important measure, which is routinely used to quantify this. Previous studies have shown that the FST estimated for selectively constrained regions was significantly lower than that estimated for neutral regions. By deriving the theoretical relationship between FST at neutral and constrained sites, we show that excess in the fraction of deleterious variations segregating within populations compared to those segregating between populations is the cause for the reduction in FST estimated at constrained sites. Using whole-genome data, our results revealed that the magnitude of reduction in FST estimates obtained for selectively constrained regions was much higher for distantly related populations compared to those estimated for closely related pairs. For example, the reduction was 47% for comparison between Europeans and Africans, 30% for the European and Asian comparison, 16% for the Northern and Southern European pair, and only 4% for the comparison involving two Southern European (Italian and Spanish) populations. Since deleterious variants are purged over time due to purifying selection, their contribution to the among-population diversity at constrained sites decreases with the increase in the divergence between populations. However, within-population diversities remain the same for all pairs compared; therefore, the FST estimated at constrained sites for distantly related populations are much smaller than those estimated for closely related populations. We obtained similar results when only the SNPs with similar allele frequencies at neutral and constrained sites were used. Our results suggest that the level of population divergence should be considered when comparing constrained site FST estimates from different pairs of populations.

Highlights

  • Since the introduction of F-statistics by Sewall Wright [1], the fixation index, or FST, has been routinely used to measure the extent of differentiation between populations [2,3,4,5,6,7,8,9,10,11,12]

  • Nonsynonymous single nucleotide polymorphisms (SNPs) were grouped into seven categories based on their C-scores

  • We show that the FST estimated for nonsynonymous SNPs (nSNPs) is expected to be smaller than that of synonymous SNPs (sSNPs) as the fraction of neutral + deleterious SNPs segregating within populations is higher than those segregating between populations (Equation (7))

Read more

Summary

Introduction

Since the introduction of F-statistics by Sewall Wright [1], the fixation index, or FST, has been routinely used to measure the extent of differentiation between populations [2,3,4,5,6,7,8,9,10,11,12]. A number of methods have been developed to measure FST using genetic data, such as by [13] Nei (1973), [14] Weir and Cockerham (1984), and [15] Hudson (1992), which were based on Wright’s F-statistics. An alternative method based on genetic distances was developed to measure population differentiation in gene, gametic, and genotypic frequency data [16]. A novel method based on allele frequency difference (AFD) was developed to measure population differentiation [20]. Despite the availability of many methods, the first three methods mentioned above are widely used in population genetics and evolutionary biology

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call