RnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data.

Elena Bushmanova,Alla Lapidus,Andrey D Prjibelski,Dmitry Antipov

doi:10.1093/gigascience/giz100

Elena Bushmanova, Alla Lapidus + Show 2 more

Open Access

https://doi.org/10.1093/gigascience/giz100

Copy DOI

Journal: GigaScience	Publication Date: Sep 1, 2019
Citations: 507	License type: CC BY 4.0

Affiliation: St Petersburg University

Abstract

BackgroundThe possibility of generating large RNA-sequencing datasets has led to development of various reference-based and de novo transcriptome assemblers with their own strengths and limitations. While reference-based tools are widely used in various transcriptomic studies, their application is limited to the organisms with finished and well-annotated genomes. De novo transcriptome reconstruction from short reads remains an open challenging problem, which is complicated by the varying expression levels across different genes, alternative splicing, and paralogous genes.ResultsHerein we describe the novel transcriptome assembler rnaSPAdes, which has been developed on top of the SPAdes genome assembler and explores computational parallels between assembly of transcriptomes and single-cell genomes. We also present quality assessment reports for rnaSPAdes assemblies, compare it with modern transcriptome assembly tools using several evaluation approaches on various RNA-sequencing datasets, and briefly highlight strong and weak points of different assemblers.ConclusionsBased on the performed comparison between different assembly methods, we infer that it is not possible to detect the absolute leader according to all quality metrics and all used datasets. However, rnaSPAdes typically outperforms other assemblers by such important property as the number of assembled genes and isoforms, and at the same time has higher accuracy statistics on average comparing to the closest competitors.

Highlights

While reference-based methods for RNA-Seq analysis [5,7,10,11,16,23] currently dominate transcriptome studies, they are subjected to the following constraints: (i) they are not applicable in the case when the genome is unknown, (ii) their performance deteriorates when the genome sequence or annotation are incomplete, and (iii) they may miss unusual transcripts even when the reference genome is available
De novo transcriptome assemblers [6,15,19,20,25] have emerged as a viable complement to the reference-based tools
While the transcriptome assembly may seem to be a simpler problem than the genome assembly, RNA-Seq assemblers have to address the complications arising from highly uneven read coverage depth caused by variations in gene expression levels

Summary

Introduction

While reference-based methods for RNA-Seq analysis [5,7,10,11,16,23] currently dominate transcriptome studies, they are subjected to the following constraints: (i) they are not applicable in the case when the genome is unknown, (ii) their performance deteriorates when the genome sequence or annotation are incomplete, and (iii) they may miss unusual transcripts (such as fusion genes or genes with short unannotated exons) even when the reference genome is available To address these constraints, de novo transcriptome assemblers [6,15,19,20,25] have emerged as a viable complement to the reference-based tools. Even though SPAdes is a genome assembler and was not optimized for RNA-seq data, in some cases it generated decent assemblies of quality comparable to the state-of-the-art transcriptome assemblers

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: GigaScience

Lead the way for us

Similar Papers

NOVA1-Mediated SORBS2 Isoform Promotes Colorectal Cancer Migration by Activating the Notch Pathway.
Tao Zhang ... Xi Cheng
Frontiers in cell and developmental biology | VOL. 9
Tao Zhang, et. al.Tao Zhang ... Xi Cheng
08 Oct 2021
Frontiers in cell and developmental biology | VOL. 9

Bipartite functions of the CREB co-activators selectively direct alternative splicing or transcriptional activation
Antonio L Amelio ... Michael D Conkright
The EMBO Journal | VOL. 28
Antonio L Amelio, et. al.Antonio L Amelio ... Michael D Conkright
30 Jul 2009
The EMBO Journal | VOL. 28

SOAPdenovo-Trans: de novo transcriptome assembly with short RNA-Seq reads
Y Xie ... S Gu
Bioinformatics | VOL. 30
Y Xie, et. al.Y Xie ... S Gu
13 Feb 2014
Bioinformatics | VOL. 30

Alternative Pre-mRNA Splicing, Cell Death, and Cancer
Kong Ruirui ... Jane Y Wu
-
Kong Ruirui, et. al.Kong Ruirui ... Jane Y Wu
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: GigaScience