Abstract

BackgroundMassively parallel cDNA sequencing (RNA-seq) experiments are gradually superseding microarrays in quantitative gene expression profiling. However, many biologists are uncertain about the choice of differentially expressed gene (DEG) analysis methods and the validity of cost-saving sample pooling strategies for their RNA-seq experiments. Hence, we performed experimental validation of DEGs identified by Cuffdiff2, edgeR, DESeq2 and Two-stage Poisson Model (TSPM) in a RNA-seq experiment involving mice amygdalae micro-punches, using high-throughput qPCR on independent biological replicate samples. Moreover, we sequenced RNA-pools and compared their results with sequencing corresponding individual RNA samples.ResultsFalse-positivity rate of Cuffdiff2 and false-negativity rates of DESeq2 and TSPM were high. Among the four investigated DEG analysis methods, sensitivity and specificity of edgeR was relatively high. We documented the pooling bias and that the DEGs identified in pooled samples suffered low positive predictive values.ConclusionsOur results highlighted the need for combined use of more sensitive DEG analysis methods and high-throughput validation of identified DEGs in future RNA-seq experiments. They indicated limited utility of sample pooling strategies for RNA-seq in similar setups and supported increasing the number of biological replicate samples.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-015-1767-y) contains supplementary material, which is available to authorized users.

Highlights

  • Parallel cDNA sequencing (RNA-seq) experiments are gradually superseding microarrays in quantitative gene expression profiling

  • Cuffdiff2 identified more than half (51.67 %) of the true-positive differentially expressed gene (DEG), but contributed 87 % of the false positive DEGs. edgeR displayed the best sensitivity (76.67 %) and overall agreement with a false positivity rate of 9 %

  • Spearman correlation coefficients between the logarithmic fold changes (LFC), estimated by quantitative reverse-transcription PCR (qPCR), and those estimated by edgeR, Cuffdiff2, DESeq2 and Two-stage Poisson Model (TSPM) were 0.541, 0.524, 0.453 and 0.511, respectively (p < 0.001) [Additional file 6]

Read more

Summary

Introduction

Parallel cDNA sequencing (RNA-seq) experiments are gradually superseding microarrays in quantitative gene expression profiling. Many biologists are uncertain about the choice of differentially expressed gene (DEG) analysis methods and the validity of cost-saving sample pooling strategies for their RNA-seq experiments. RNA-seq experiments, often pose questions on sample size requirements, cost-effective strategies for sample pooling, and on the choice of data analysis software. Current literature on this topic [4,5,6,7,8,9,10] do not provide unequivocal answers to these important questions [1]

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call