Abstract
Despite the fundamental importance of single nucleotide polymorphisms (SNPs) to human evolution, there are still large gaps in our understanding of the forces that shape their distribution across the genome. SNPs have been shown to not be distributed evenly, with directly adjacent SNPs found unusually frequently. Why this is the case is unclear. We illustrate how neighboring SNPs that cannot be explained by a single mutation event (that we term here sequential dinucleotide mutations [SDMs]) are driven by distinct processes to SNPs and multinucleotide polymorphisms (MNPs). By studying variation across populations, including a novel cohort of 1,358 Scottish genomes, we show that, SDMs are over twice as common as MNPs and like SNPs display distinct mutational spectra across populations. These biases are not only different to those observed among SNPs and MNPs but are also more divergent between human population groups. We show that the changes that make up SDMs are not independent and identify a distinct mutational profile, CA → CG → TG, that is observed an order of magnitude more often than expected from background SNP rates and the numbers of other SDMs involving the gain and deamination of CpG sites. Intriguingly particular pathways through the amino acid code appear to have been favored relative to that expected from intergenic SDM rates and the occurrences of coding SNPs, and in particular those that lead to the creation of single codon amino acids. We finally present evidence that epistatic selection has potentially disfavored sequential nonsynonymous changes in the human genome.
Highlights
Single nucleotide polymorphisms (SNPs) are the pillars of modern genetics studies (Gray et al, 2000; Brookes, 1999)
Evidence for epistatic selection at neighbouring coding polymorphisms We investigated whether there is evidence of epistatic selection acting across the polymorphisms that make up coding sequential dinucleotide mutations (SDMs) i.e. whether the selective pressure acting on a nucleotide change depends on neighbouring changes
The changes that make up SDMs have been most often categorised as individual SNPs, we have shown they appear to be driven by distinct mutational processes
Summary
Single nucleotide polymorphisms (SNPs) are the pillars of modern genetics studies (Gray et al, 2000; Brookes, 1999). Many neighbouring polymorphisms occur at different allele frequencies in the population suggesting they have arisen from two distinct mutations (Hodgkinson and Eyre-Walker, 2010), which we term here sequential dinucleotide mutations (SDMs). These clustered changes have been comparatively understudied and may reflect mutational hotspots but may be the result of selection, with the impact of an initial deleterious SNP being at least partly corrected by a second neighbouring change (Davis et al, 2009). We characterise the functional importance of these SDMs by studying the selective pressures acting upon them and whether they have favoured particular pathways through the amino acid code
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.