Abstract

Detection and quantification of circular RNAs (circRNAs) face several significant challenges, including high false discovery rate, uneven rRNA depletion and RNase R treatment efficiency, and underestimation of back-spliced junction reads. Here, we propose a novel algorithm, CIRIquant, for accurate circRNA quantification and differential expression analysis. By constructing pseudo-circular reference for re-alignment of RNA-seq reads and employing sophisticated statistical models to correct RNase R treatment biases, CIRIquant can provide more accurate expression values for circRNAs with significantly reduced false discovery rate. We further develop a one-stop differential expression analysis pipeline implementing two independent measures, which helps unveil the regulation of competitive splicing between circRNAs and their linear counterparts. We apply CIRIquant to RNA-seq datasets of hepatocellular carcinoma, and characterize two important groups of linear-circular switching and circular transcript usage switching events, which demonstrate the promising ability to explore extensive transcriptomic changes in liver tumorigenesis.

Highlights

  • Detection and quantification of circular RNAs face several significant challenges, including high false discovery rate, uneven ribosomal RNA (rRNA) depletion and RNase R treatment efficiency, and underestimation of back-spliced junction reads

  • To rigorously evaluate the challenges in current quantification of circRNA expression, we collected 63 transcriptomic samples from six previous studies[16,17,18,19,20,21,22], including both RiboMinus and RiboMinus/RNase R RNA-seq libraries of four species. All these RNA-seq datasets were aligned to their reference genomes and the ribosomal RNA sequences using HISAT223 to assess the mapping rate and rRNA sequence fraction

  • Several computational methods have been developed for circRNA detection, the reliability of the output is a crucial factor for further analysis

Read more

Summary

Introduction

Detection and quantification of circular RNAs (circRNAs) face several significant challenges, including high false discovery rate, uneven rRNA depletion and RNase R treatment efficiency, and underestimation of back-spliced junction reads. By constructing pseudo-circular reference for re-alignment of RNA-seq reads and employing sophisticated statistical models to correct RNase R treatment biases, CIRIquant can provide more accurate expression values for circRNAs with significantly reduced false discovery rate. A modelbased strategy is employed by Sailfish-cir[10], which uses a quasimapping method to acquire direct estimation of circular transcript expression. This statistical model depends on the unique sequence between circular and linear transcripts, which limits its ability on quantifying exonic circRNAs. the ratio of BSJ reads to canonical linear reads at the junction, which represents the splicing preference in precursor mRNAs, is an important factor in circRNA analysis. A reliable computational tool is urgently needed for accurate quantification of circRNAs and their parental linear transcripts

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call