Transcription-related mutations and GC content drive variation in nucleotide substitution rates across the genomes of Arabidopsis thaliana and Arabidopsis lyrata

Leah J Derose-Wilson,Brandon S Gaut

doi:10.1186/1471-2148-7-66

Leah J Derose-Wilson, Brandon S Gaut

Open Access

https://doi.org/10.1186/1471-2148-7-66

Copy DOI

Journal: BMC Evolutionary Biology	Publication Date: Jan 1, 2007
Citations: 95	License type: cc-by

Affiliation: University of California, Irvine

Abstract

BackgroundThere has been remarkably little study of nucleotide substitution rate variation among plant nuclear genes, in part because orthology is difficult to establish. Orthology is even more problematic for intergenic regions of plant nuclear genomes, because plant genomes generally harbor a wealth of repetitive DNA. In theory orthologous intergenic data is valuable for studying rate variation because nucleotide substitutions in these regions should be under little selective constraint compared to coding regions. As a result, evolutionary rates in intergenic regions may more accurately reflect genomic features, like recombination and GC content, that contribute to nucleotide substitution.ResultsWe generated a set of 66 intergenic sequences in Arabidopsis lyrata, a close relative of Arabidopsis thaliana. The intergenic regions included transposable element (TE) remnants and regions flanking the TEs. We verified orthology of these amplified regions both by comparison of existing A. lyrata – A. thaliana genetic maps and by using molecular features. We compared substitution rates among the 66 intergenic loci, which exhibit ~5-fold rate variation, and compared intergenic rates to a set of 64 orthologous coding sequences. Our chief observations were that the average rate of nucleotide substitution is slower in intergenic regions than in synonymous sites, that rate variation in both intergenic and coding regions correlate with GC content, that GC content alone is not sufficient to explain differences in rates between intergenic and coding regions, and that rates of evolution in intergenic regions correlate negatively with gene density.ConclusionOur observations indicated that mutation rates vary among genomics regions as a function of base composition, suggesting that previous observations of "selective constraint" on non-coding regions could more accurately be attributed to a GC effect instead of selection. The negative correlation between nucleotide substitution rate and gene density provides a potential neutral explanation for a previously documented correlation between gene density and polymorphism levels within A. thaliana. Finally, we discuss potential forces that could contribute to rapid synonymous rates, and provide evidence to suggest that transcription-related mutation contributes to rate differences between intergenic and synonymous sites.

Highlights

There has been remarkably little study of nucleotide substitution rate variation among plant nuclear genes, in part because orthology is difficult to establish
The intergenic data are contrasted to a second data set consisting of large (> 400 bp) exonic sequences from A. lyrata and A. thaliana. With these two data sets, we address several questions about Arabidopsis nucleotide substitution rates, such as: i) do intergenic sequences evolve at rates similar to synonymous sites in coding data? ii) do any genomic features, like GC content or recombination, correlate with nucleotide substitution rate variation among loci? iii) what can be inferred about the relative contribution of mutation and selection to nucleotide substitution? and iv) do intergenic regions provide any hints to the mechanisms that contribute to genome size differences between A. lyrata and A. thaliana?
It is clear that GC content is a major determinant of evolutionary rate variation, between sequence types and among loci

Summary

Introduction

There has been remarkably little study of nucleotide substitution rate variation among plant nuclear genes, in part because orthology is difficult to establish. In theory orthologous intergenic data is valuable for studying rate variation because nucleotide substitutions in these regions should be under little selective constraint compared to coding regions. Evolutionary rates in intergenic regions may more accurately reflect genomic features, like recombination and GC content, that contribute to nucleotide substitution. The primary processes that contribute to nucleotide substitution rates are mutation, selection, and population history, but their relative contributions can vary substantially among genes and genomic regions. Our understanding about the evolutionary forces that contribute to nucleotide substitution rates has been based primarily on the study of coding regions. The important point is that it can be difficult to disentangle the contribution of selection and mutation to rate variation among coding regions

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transcription-related mutations and GC content drive variation in nucleotide substitution rates across the genomes of Arabidopsis thaliana and Arabidopsis lyrata

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Evolutionary Biology

Lead the way for us

Similar Papers

Substitution rate comparisons between grasses and palms: synonymous rate differences at the nuclear gene Adh parallel rate differences at the plastid gene rbcL.
B S Gaut ... B R Morton
Proceedings of the National Academy of Sciences | VOL. 93
B S Gaut, et. al.B S Gaut ... B R Morton
17 Sep 1996
Proceedings of the National Academy of Sciences | VOL. 93

Rates of gene rearrangement and nucleotide substitution are correlated in the mitochondrial genomes of insects.
R Shao
Molecular Biology and Evolution | VOL. 20
R ShaoR Shao
27 Jun 2003
Molecular Biology and Evolution | VOL. 20

Mutation rates differ among regions of the mammalian genome.
Kenneth H Wolfe ... Wen-Hsiung Li
Nature | VOL. 337
Kenneth H Wolfe, et. al.Kenneth H Wolfe ... Wen-Hsiung Li
01 Jan 1989
Nature | VOL. 337

Rates of nucleotide substitution in Cornaceae (Cornales)—Pattern of variation and underlying causal factors
Qiu-Yun (Jenny) Xiang ... Robert E Ricklefs
Molecular Phylogenetics and Evolution | VOL. 49
Qiu-Yun (Jenny) Xiang, et. al.Qiu-Yun (Jenny) Xiang ... Robert E Ricklefs
19 Jul 2008
Molecular Phylogenetics and Evolution | VOL. 49

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transcription-related mutations and GC content drive variation in nucleotide substitution rates across the genomes of Arabidopsis thaliana and Arabidopsis lyrata

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Evolutionary Biology