Abstract

BackgroundRecently, genomic sequencing efforts were finished for Oryza sativa (cultivated rice) and Arabidopsis thaliana (Arabidopsis). Additionally, these two plant species have extensive cDNA and expressed sequence tag (EST) libraries. We employed the Program to Assemble Spliced Alignments (PASA) to identify and analyze alternatively spliced isoforms in both species.ResultsA comprehensive analysis of alternative splicing was performed in rice that started with >1.1 million publicly available spliced ESTs and over 30,000 full length cDNAs in conjunction with the newly enhanced PASA software. A parallel analysis was performed with Arabidopsis to compare and ascertain potential differences between monocots and dicots. Alternative splicing is a widespread phenomenon (observed in greater than 30% of the loci with transcript support) and we have described nine alternative splicing variations. While alternative splicing has the potential to create many RNA isoforms from a single locus, the majority of loci generate only two or three isoforms and transcript support indicates that these isoforms are generally not rare events. For the alternate donor (AD) and acceptor (AA) classes, the distance between the splice sites for the majority of events was found to be less than 50 basepairs (bp). In both species, the most frequent distance between AA is 3 bp, consistent with reports in mammalian systems. Conversely, the most frequent distance between AD is 4 bp in both plant species, as previously observed in mouse. Most alternative splicing variations are localized to the protein coding sequence and are predicted to significantly alter the coding sequence.ConclusionAlternative splicing is widespread in both rice and Arabidopsis and these species share many common features. Interestingly, alternative splicing may play a role beyond creating novel combinations of transcripts that expand the proteome. Many isoforms will presumably have negative consequences for protein structure and function, suggesting that their biological role involves post-transcriptional regulation of gene expression.

Highlights

  • IntroductionGenomic sequencing efforts were finished for Oryza sativa (cultivated rice) and Arabidopsis thaliana (Arabidopsis)

  • Genomic sequencing efforts were finished for Oryza sativa and Arabidopsis thaliana (Arabidopsis)

  • Program to Assemble Spliced Alignments (PASA)-generated transcript alignment assemblies for rice and Arabidopsis All expressed sequence tag (EST) and mRNA (mRNAs include full-length cDNAs (FL-cDNAs)) sequences were obtained from public data

Read more

Summary

Introduction

Genomic sequencing efforts were finished for Oryza sativa (cultivated rice) and Arabidopsis thaliana (Arabidopsis). These two plant species have extensive cDNA and expressed sequence tag (EST) libraries. We performed a comprehensive analysis of alternative splicing in rice, a model monocotyledonous species, using the latest set of rice transcript data and the Program to Assemble Spliced Alignments (PASA). The PASA software [5] assembles and clusters spliced transcript alignments, providing transcript-based gene structures that are used to automatically improve existing gene annotations by adding untranslated regions (UTRs), adjusting intron and exon boundaries, and adding new models that represent alternative splicing, among its numerous other functions. After assembling all overlapping alignments, those distinct PASA assemblies found overlapping at the same locus and transcribed in the same orientation yield representative splicing isoforms for that locus

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call