Abstract

Despite the fundamental importance of single nucleotide polymorphisms (SNPs) to human evolution, there are still large gaps in our understanding of the forces that shape their distribution across the genome. SNPs have been shown to not be distributed evenly, with directly adjacent SNPs found unusually frequently. Why this is the case is unclear. We illustrate how neighboring SNPs that cannot be explained by a single mutation event (that we term here sequential dinucleotide mutations [SDMs]) are driven by distinct processes to SNPs and multinucleotide polymorphisms (MNPs). By studying variation across populations, including a novel cohort of 1,358 Scottish genomes, we show that, SDMs are over twice as common as MNPs and like SNPs display distinct mutational spectra across populations. These biases are not only different to those observed among SNPs and MNPs but are also more divergent between human population groups. We show that the changes that make up SDMs are not independent and identify a distinct mutational profile, CA → CG → TG, that is observed an order of magnitude more often than expected from background SNP rates and the numbers of other SDMs involving the gain and deamination of CpG sites. Intriguingly particular pathways through the amino acid code appear to have been favored relative to that expected from intergenic SDM rates and the occurrences of coding SNPs, and in particular those that lead to the creation of single codon amino acids. We finally present evidence that epistatic selection has potentially disfavored sequential nonsynonymous changes in the human genome.

Highlights

  • Single nucleotide polymorphisms (SNPs) are the pillars of modern genetics studies (Gray et al, 2000; Brookes, 1999)

  • Evidence for epistatic selection at neighbouring coding polymorphisms We investigated whether there is evidence of epistatic selection acting across the polymorphisms that make up coding sequential dinucleotide mutations (SDMs) i.e. whether the selective pressure acting on a nucleotide change depends on neighbouring changes

  • The changes that make up SDMs have been most often categorised as individual SNPs, we have shown they appear to be driven by distinct mutational processes

Read more

Summary

Introduction

Single nucleotide polymorphisms (SNPs) are the pillars of modern genetics studies (Gray et al, 2000; Brookes, 1999). Many neighbouring polymorphisms occur at different allele frequencies in the population suggesting they have arisen from two distinct mutations (Hodgkinson and Eyre-Walker, 2010), which we term here sequential dinucleotide mutations (SDMs). These clustered changes have been comparatively understudied and may reflect mutational hotspots but may be the result of selection, with the impact of an initial deleterious SNP being at least partly corrected by a second neighbouring change (Davis et al, 2009). We characterise the functional importance of these SDMs by studying the selective pressures acting upon them and whether they have favoured particular pathways through the amino acid code

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call