Abstract

The completion of the rice (Oryza sativa) genome draft has brought unprecedented opportunities for genomic studies of the world's most important food crop. Previous rice gene annotations have relied mainly on ab initio methods, which usually yield a high rate of false-positive predictions and give only limited information regarding alternative splicing in rice genes. Comparative approaches based on expressed sequence tags (ESTs) can compensate for the drawbacks of ab initio methods because they can simultaneously identify experimental data-supported genes and alternatively spliced transcripts. Furthermore, cross-species EST information can be used to not only offset the insufficiency of same-species ESTs but also derive evolutionary implications. In this study, we used ESTs from seven plant species, rice, wheat (Triticum aestivum), maize (Zea mays), barley (Hordeum vulgare), sorghum (Sorghum bicolor), soybean (Glycine max), and Arabidopsis (Arabidopsis thaliana), to annotate the rice genome. We developed a plant genome annotation pipeline, Plant Gene and Alternatively Spliced Variant Annotator (PGAA). Using this approach, we identified 852 genes (931 isoforms) not annotated in other widely used databases (i.e. the Institute for Genomic Research, National Center for Biotechnology Information, and Rice Annotation Project) and found 87% of them supported by both rice and nonrice EST evidence. PGAA also identified more than 44,000 alternatively spliced events, of which approximately 20% are not observed in the other three annotations. These novel annotations represent rich opportunities for rice genome research, because the functions of most of our annotated genes are currently unknown. Also, in the PGAA annotation, the isoforms with non-rice-EST-supported exons are significantly enriched in transporter activity but significantly underrepresented in transcription regulator activity. We have also identified potential lineage-specific and conserved isoforms, which are important markers in evolutionary studies. The data and the Web-based interface, RiceViewer, are available for public access at http://RiceViewer.genomics.sinica.edu.tw/.

Highlights

  • The completion of the rice (Oryza sativa) genome draft has brought unprecedented opportunities for genomic studies of the world’s most important food crop

  • Because other crop plants and model organisms, such as maize (Zea mays), wheat (Triticum aestivum), and Arabidopsis (Arabidopsis thaliana) have been widely studied, inclusion of expressed sequence tags (ESTs) from these plant species may lead to the identification of rice genes/AS isoforms that have not previously been annotated

  • Plant Gene and Alternatively Spliced Variant Annotator (PGAA) is a comparative method that first identifies AS variants and genes by use of the same-species-EST-to-genome comparison and curates the results with cross-species EST data conserved in the annotated genome

Read more

Summary

Introduction

The completion of the rice (Oryza sativa) genome draft has brought unprecedented opportunities for genomic studies of the world’s most important food crop. We developed a plant genome annotation pipeline, Plant Gene and Alternatively Spliced Variant Annotator (PGAA) Using this approach, we identified 852 genes (931 isoforms) not annotated in other widely used databases (i.e. the Institute for Genomic Research, National Center for Biotechnology Information, and Rice Annotation Project) and found 87% of them supported by both rice and nonrice EST evidence. The Gramene Web site (http:// www.gramene.org/Oryza_sativa) demonstrates that cross-species EST mapping to the rice genome may benefit evolutionary studies of the grass family, and a recent study comparing AS events between rice and Arabidopsis (Wang and Brendel, 2006) showed that important findings could be revealed with crossspecies EST comparisons. PGAA is a comparative method that first identifies AS variants and genes by use of the same-species-EST-to-genome comparison and curates the results with cross-species EST data conserved in the annotated genome. To facilitate comparison of the annotation results and EST conservation, we have developed a Web-based visualization tool, RiceViewer, which is readily accessible to the public

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.