Abstract

BackgroundThe basic process of RNA splicing is conserved among eukaryotic species. Three signals (5' and 3' splice sites and branch site) are commonly used to directly conduct splicing, while other features are also related to the recognition of an intron. Although there is experimental evidence pointing to the significant species specificities in the features of intron recognition, a quantitative evaluation of the divergence of these features among a wide variety of eukaryotes has yet to be conducted.ResultsTo better understand the splicing process from the viewpoints of evolution and information theory, we collected introns from 61 diverse species of eukaryotes and analyzed the properties of the nucleotide sequences relevant to splicing. We found that trees individually constructed from the five features (the three signals, intron length, and nucleotide composition within an intron) roughly reflect the phylogenetic relationships among the species but sometimes extensively deviate from the species classification. The degree of topological deviation of each feature tree from the reference trees indicates the lowest discordance for the 5' splicing signal, followed by that for the 3' splicing signal, and a considerably greater discordance for the other three features. We also estimated the relative contributions of the five features to short intron recognition in each species. Again, moderate correlation was observed between the similarities in pattern of short intron recognition and the genealogical relationships among the species. When mammalian introns were categorized into three subtypes according to their terminal dinucleotide sequences, each subtype segregated into a nearly monophyletic group, regardless of the host species, with respect to the 5' and 3' splicing signals. It was also found that GC-AG introns are extraordinarily abundant in some species with high genomic G + C contents, and that the U12-type spliceosome might make a greater contribution than currently estimated in most species.ConclusionsOverall, the present study indicates that both splicing signals themselves and their relative contributions to short intron recognition are rather susceptible to evolutionary changes, while some poorly characterized properties seem to be preserved within the mammalian intron subtypes. Our findings may afford additional clues to understanding of evolution of splicing mechanisms.

Highlights

  • The basic process of RNA splicing is conserved among eukaryotic species

  • AT-AC introns are generally considered to be the U12 type [10]. To better characterize this minor type of introns, we investigated AT-AC introns in five species, H. sapiens, Mus musculus, D. melanogaster, A. thaliana, and O. sativa, whose genomic sequences are of high quality

  • We have characterized representative features involved in RNA splicing in divergent eukaryotic species from the viewpoints of evolution and information theory

Read more

Summary

Introduction

Three signals (5’ and 3’ splice sites and branch site) are commonly used to directly conduct splicing, while other features are related to the recognition of an intron. The intron is a nucleotide sequence that is transcribed from the genome but removed in the process of maturation of the transcribed RNA. Broadly distributed exonic and intronic splicing enhancers/silencers are involved in the correct recognition and regulation of a particular pair of 5’ss and 3’ss to be spliced. Besides these signal sequences that are directly involved in splicing, other features are related to intron recognition, including intron length and nucleotide composition within the intron [5]

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call