Abstract

Codon composition, GC content and local RNA secondary structures can have a profound effect on gene expression, and mutations affecting these parameters, even though they do not alter the protein sequence, are not neutral in terms of selection. Although evidence exists that, in some cases, selection favours more stable RNA secondary structures, we currently lack a concrete idea of how many genes are affected within a species, and whether this is a universal phenomenon in nature. We searched for signs of structural selection in a global manner, analysing a set of 1 million coding sequences from 73 species representing all domains of life, as well as viruses, by means of our newly developed software PACKEIS. We show that codon composition and amino acid identity are main determinants of RNA secondary structure. In addition, we show that the arrangement of synonymous codons within coding sequences is non-random, yielding extremely high, but also extremely low, RNA structuredness significantly more often than expected by chance. Taken together, we demonstrate that selection for high and low levels of secondary structure is a widespread phenomenon. Our results provide another line of evidence that synonymous mutations are less neutral than commonly thought, which is of importance for many evolutionary models.

Highlights

  • IntroductionThe genetic code of DNA uses units of three nucleotides (codons) to code for one amino acid

  • The genetic code of DNA uses units of three nucleotides to code for one amino acid

  • We have developed the highly parallelizable software PACKEIS, which compares the degree of backfolding (DBF) of the oORFs with that of a defined number of alternative ORFs (aORFs), yielding a DBF score ranging from 0 to 1; this score refers to the DBF in the light of alternative codon usage, with 0 representing extremely low structuredness and 1 representing extremely high structuredness

Read more

Summary

Introduction

The genetic code of DNA uses units of three nucleotides (codons) to code for one amino acid. Mutations at the DNA level do not necessarily result in an altered amino acid sequence of the corresponding protein. These silent (synonymous) substitutions have long been assumed to be neutral in terms of natural selection [1]. Silent substitutions will necessarily result in altered codon composition of a gene and further have the potential to alter a gene’s GC content, both of which are features that can be subject to selection [2]. Silent substitutions can change the secondary structure of an mRNA, thereby affecting the process of translation [3,4,5,6], and non-random patterns of secondary structures within protein coding genes in different species have been explained by natural selection [7 –10]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call