Abstract

The emergence of NextGen sequencing technology has generated much interest in the exploration of transcriptomes. Currently, Illumina Inc. (San Diego, CA) provides one of the most widely utilized sequencing platforms for gene expression analysis. While Illumina reagents and protocols perform adequately in RNA-sequencing (RNA-seq), alternative reagents and protocols promise a higher throughput at a much lower cost. We have developed a low-cost and robust protocol to produce Illumina-compatible (GAIIx and HiSeq2000 platforms) RNA-seq libraries by combining several recent improvements. First, we designed balanced adapter sequences for multiplexing of samples; second, dUTP incorporation in 2nd strand synthesis was used to enforce strand-specificity; third, we simplified RNA purification, fragmentation and library size-selection steps thus drastically reducing the time and increasing throughput of library construction; fourth, we included an RNA spike-in control for validation and normalization purposes. To streamline informatics analysis for the community, we established a pipeline within the iPlant Collaborative. These scripts are easily customized to meet specific research needs and improve on existing informatics and statistical treatments of RNA-seq data. In particular, we apply significance tests for determining differential gene expression and intron retention events. To demonstrate the potential of both the library-construction protocol and data-analysis pipeline, we characterized the transcriptome of the rice leaf. Our data supports novel gene models and can be used to improve current rice genome annotation. Additionally, using the rice transcriptome data, we compared different methods of calculating gene expression and discuss the advantages of a strand-specific approach to detect bona-fide anti-sense transcripts and to detect intron retention events. Our results demonstrate the potential of this low cost and robust method for RNA-seq library construction and data analysis.

Highlights

  • The advent of ultra-high-throughput sequencing (UHTS) technology has invoked a paradigm shift in the field of genomics and transcriptomics [1,2]

  • It is possible to obtain whole-genome scale information at a highly accelerated rate. This advancement in sequencing technology has led to new opportunities to explore global genomic and transcriptomic landscapes; such studies include wholegenome de novo/re-sequencing [3,4], bisulfite-sequencing [5,6], chromatin immuno-precipitation-sequencing (Chip-seq) [7,8], and RNA sequencing (RNA-seq) [9,10]

  • We incorporated a step that preserves the strandspecific nature of mRNA molecules

Read more

Summary

Introduction

The advent of ultra-high-throughput sequencing (UHTS) technology has invoked a paradigm shift in the field of genomics and transcriptomics [1,2]. No standard has been established for methods of processing RNA-seq data for gene expression estimation, normalization, comparison and experimental design. We describe a low-cost and robust method of generating strand-specific Illumina-compatible libraries for RNAseq and a data analysis pipeline to improve gene quantification and detection.

Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call