Abstract

BackgroundCrop plants such as rice, maize and sorghum play economically-important roles as main sources of food, fuel, and animal feed. However, current genome annotations of crop plants still suffer false-positive predictions; a more comprehensive registry of alternative splicing (AS) events is also in demand. Comparative genomics of crop plants is largely unexplored.ResultsWe performed a large-scale comparative analysis (ExonFinder) of the expressed sequence tag (EST) library from nine grass plants against three crop genomes (rice, maize, and sorghum) and identified 2,879 previously-unannotated exons (i.e., novel exons) in the three crops. We validated 81% of the tested exons by RT-PCR-sequencing, supporting the effectiveness of our in silico strategy. Evolutionary analysis reveals that the novel exons, comparing with their flanking annotated ones, are generally under weaker selection pressure at the protein level, but under stronger pressure at the RNA level, suggesting that most of the novel exons also represent novel alternatively spliced variants (ASVs). However, we also observed the consistency of evolutionary rates between certain novel exons and their flanking exons, which provided further evidence of their co-occurrence in the transcripts, suggesting that previously-annotated isoforms might be subject to erroneous predictions. Our validation showed that 54% of the tested genes expressed the newly-identified isoforms that contained the novel exons, rather than the previously-annotated isoforms that excluded them. The consistent results were steadily observed across cultivated (Oryza sativa and O. glaberrima) and wild (O. rufipogon and O. nivara) rice species, asserting the necessity of our curation of the crop genome annotations. Our comparative analyses also inferred the common ancestral transcriptome of grass plants and gain- and loss-of-ASV events.ConclusionsWe have reannotated the rice, maize, and sorghum genomes, and showed that evolutionary rates might serve as an indicator for determining whether the identified exons were alternatively spliced. This study not only presents an effective in silico strategy for the improvement of plant annotations, but also provides further insights into the role of AS events in the evolution and domestication of crop plants. ExonFinder and the novel exons/ASVs identified are publicly accessible at http://exonfinder.sourceforge.net/.Electronic supplementary materialThe online version of this article (doi:10.1186/s12870-015-0431-7) contains supplementary material, which is available to authorized users.

Highlights

  • Crop plants such as rice, maize and sorghum play economically-important roles as main sources of food, fuel, and animal feed

  • Identification of novel exons in rice, maize, and sorghum We introduced an in silico pipeline, ExonFinder, to identify previously unannotated exons/Alternatively spliced variant (ASV) in target species by comparative analysis of the expressed sequence tag (EST) library of non-target species against the genome of target species (Table 1 and Figure 1A)

  • To eliminate false positives from accidental matches, we only considered EST matches that satisfied the following criteria: (1) a proper exon and its flanking exons must overlap with the same Ensemblannotated transcript; (2) a proper cassette exon must be flanked by canonical splicing sites at its both ends; and (3)

Read more

Summary

Introduction

Crop plants such as rice, maize and sorghum play economically-important roles as main sources of food, fuel, and animal feed. Current genome annotations of crop plants still suffer false-positive predictions; a more comprehensive registry of alternative splicing (AS) events is in demand. It remains challenging to determine whether an ASV is functionally important [30,31,32,33], not to mention that AS is less characterized in plants than in mammals, and that most plant ASVs have unknown functional consequences [10], and that some of computationally-annotated genes/transcripts are subject to erroneous prediction. Much effort to annotate plant transcripts produces several prominent databases [34,35,36,37,38,39], there still lacks an effective strategy to make use of public resources (e.g., EST traces) for better annotation of ASVs and accurate identification of novel isoforms in plant genomes

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call