QuickMIRSeq: a pipeline for quick and accurate quantification of both known miRNAs and isomiRs by jointly processing multiple samples from microRNA sequencing

Shanrong Zhao,Wen He,Li Xi,Baohong Zhang,David Von Schack,Michael Agostino,Sarah Du,Sachin Mathur,William Gordon,Michael Vincent,Theresa Paradis,Chi Zhang

doi:10.1186/s12859-017-1601-4

Shanrong Zhao, Wen He + Show 10 more

Open Access

https://doi.org/10.1186/s12859-017-1601-4

Copy DOI

Journal: BMC Bioinformatics	Publication Date: Mar 20, 2017
Citations: 44	License type: open-access

Affiliation: Pfizer (United States)

Abstract

BackgroundGenome-wide miRNA expression data can be used to study miRNA dysregulation comprehensively. Although many open-source tools for microRNA (miRNA)-seq data analyses are available, challenges remain in accurate miRNA quantification from large-scale miRNA-seq dataset. We implemented a pipeline called QuickMIRSeq for accurate quantification of known miRNAs and miRNA isoforms (isomiRs) from multiple samples simultaneously.ResultsQuickMIRSeq considers the unique nature of miRNAs and combines many important features into its implementation. First, it takes advantage of high redundancy of miRNA reads and introduces joint mapping of multiple samples to reduce computational time. Second, it incorporates the strand information in the alignment step for more accurate quantification. Third, reads potentially arising from background noise are filtered out to improve the reliability of miRNA detection. Fourth, sequences aligned to miRNAs with mismatches are remapped to a reference genome to further reduce false positives. Finally, QuickMIRSeq generates a rich set of QC metrics and publication-ready plots.ConclusionsThe rich visualization features implemented allow end users to interactively explore the results and gain more insights into miRNA-seq data analyses. The high degree of automation and interactivity in QuickMIRSeq leads to a substantial reduction in the time and effort required for miRNA-seq data analysis.

Highlights

Genome-wide miRNA expression data can be used to study miRNA dysregulation comprehensively
Recent additional studies have shown that MiRNA isoforms (isomiRs) sequences are tissue and gender-specific [34] and play distinct roles in biological processes [37], which emphasize the importance of performing miRNA-seq analysis simultaneously at both the miRNA and isomiR levels
The complete project reports can be downloaded from the QuickMIRSeq project home page

Summary

Results

QuickMIRSeq can analyze miRNA-seq datasets from any species as long as the corresponding mature miRNA and hairpin databases are available. When we analyzed an in-house cell-free miRNA-seq dataset from urine, we found some samples had exceptionally high redundancy in unaligned reads (unpublished data). The read length distributions for samples SRR1759212, SRR1759213, SRR1759214, and SRR1759215 from GSE64977 are shown in Additional file 1: Figure S7. The difference mainly results from the fact that miRge ignores strand information when analyzing miRNA datasets and that its execution workflow tends to exclude reads with mismatches from quantification, as discussed further in Additional file 1: Figure S9. Bcbio-nextgen implements a configurable bestpractices pipeline for small RNA-seq data analysis (https://bcbio-nextgen.readthedocs.io/en/latest/contents/ pipelines.html#smallrna-seq), including quality controls, adapter trimming, miRNA/isomiR quantification, other small RNA detection, and prediction of new miRNAs. The quantification of known small RNAs is carried out by SeqBuster [45], a bioinformatic tool developed in 2010, while the quantification isomiRs is done by R script. QuickMIRSeq makes all analysis results fully accessible via a web interface, and enables end users to visualize them interactively

Conclusions

Background

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

QuickMIRSeq: a pipeline for quick and accurate quantification of both known miRNAs and isomiRs by jointly processing multiple samples from microRNA sequencing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

A hybrid retention time alignment algorithm for SWATH-MS data.
Long Wu ... Sabine Amon
Proteomics | VOL. 16
Long Wu, et. al.Long Wu ... Sabine Amon
01 Aug 2016
Proteomics | VOL. 16

MZDASoft: a software architecture that enables large-scale comparison of protein expression levels over multiple samples based on liquid chromatography/tandem mass spectrometry.
Mehrab Ghanat Bari ... Jianqiu (Michelle) Zhang
Rapid communications in mass spectrometry : RCM | VOL. 29
Mehrab Ghanat Bari, et. al.Mehrab Ghanat Bari ... Jianqiu (Michelle) Zhang
25 Aug 2015
Rapid communications in mass spectrometry : RCM | VOL. 29

Comprehensive assessment of multiple biases in small RNA sequencing reveals significant differences in the performance of widely used methods
Carrie Wright ... Emily E Burke
BMC Genomics | VOL. 20
Carrie Wright, et. al.Carrie Wright ... Emily E Burke
21 Jun 2019
BMC Genomics | VOL. 20

Advances in Proteome Analysis by Mass Spectrometry
Timothy J Griffin ... Ruedi Aebersold
Journal of Biological Chemistry | VOL. 276
Timothy J Griffin, et. al.Timothy J Griffin ... Ruedi Aebersold
01 Dec 2001
Journal of Biological Chemistry | VOL. 276

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

QuickMIRSeq: a pipeline for quick and accurate quantification of both known miRNAs and isomiRs by jointly processing multiple samples from microRNA sequencing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics