Leveraging multiple transcriptome assembly methods for improved gene structure annotation.

Luca Venturini,Daniel Lee Mapleson,David Swarbreck,Shabhonam Caim,Gemy George Kaithakottil

doi:10.1093/gigascience/giy093

Luca Venturini, Daniel Lee Mapleson + Show 3 more

Open Access

https://doi.org/10.1093/gigascience/giy093

Copy DOI

Abstract

ABSTRACTBackgroundThe performance of RNA sequencing (RNA-seq) aligners and assemblers varies greatly across different organisms and experiments, and often the optimal approach is not known beforehand.ResultsHere, we show that the accuracy of transcript reconstruction can be boosted by combining multiple methods, and we present a novel algorithm to integrate multiple RNA-seq assemblies into a coherent transcript annotation. Our algorithm can remove redundancies and select the best transcript models according to user-specified metrics, while solving common artifacts such as erroneous transcript chimerisms.ConclusionsWe have implemented this method in an open-source Python3 and Cython program, Mikado, available on GitHub.

Highlights

The performance of RNA sequencing (RNA-seq) aligners and assemblers varies greatly across different organisms and experiments, and often the optimal approach is not known beforehand
In line with the previous RGASP evaluation, we performed our tests on the three metazoan species of Caenhorabditis elegans, Drosophila melanogaster, and Homo sapiens using RNA-seq data from that study as input
Transcriptome assembly is a crucial component of genome annotation workflows; correctly reconstructing transcripts from short RNA-seq reads remains a challenging task

Summary

Introduction

The performance of RNA sequencing (RNA-seq) aligners and assemblers varies greatly across different organisms and experiments, and often the optimal approach is not known beforehand. For many of these species, there are only minimal expressed sequence tag (EST) and cDNA resources and limited availability of proteins from closely related species In these cases, transcriptome data from high-throughput RNA sequencing (RNA-seq) provides a vital source of evidence to aid gene structure annotation. Many approaches developed for this purpose leverage genomic alignments [9,10,11,12], there are alternatives based instead on de novo assembly [10, 13, 14] While these methods focus on how to analyze a single dataset, related research has examined how to integrate assemblies from multiple samples. While some researchers advocate for merging together reads from multiple samples and assembling them jointly [10], others have developed methods to integrate multiple assemblies into a single coherent annotation [9, 15]

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: GigaScience	Publication Date: Jul 24, 2018
Citations: 132	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Leveraging multiple transcriptome assembly methods for improved gene structure annotation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: GigaScience

Lead the way for us

Similar Papers

GenomeQC: a quality assessment tool for genome assemblies and gene structure annotations
Nancy Manchanda ... Margaret R Woodhouse
BMC Genomics | VOL. 21
Nancy Manchanda, et. al.Nancy Manchanda ... Margaret R Woodhouse
02 Mar 2020
BMC Genomics | VOL. 21

DNA-BOT: a low-cost, automated DNA assembly platform for synthetic biology.
Marko Storch ... Matthew C Haines
Synthetic Biology | VOL. 5
Marko Storch, et. al.Marko Storch ... Matthew C Haines
01 Jan 2020
Synthetic Biology | VOL. 5

A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples
Tal Yahav ... Eyal Privman
Scientific Reports | VOL. 9
Tal Yahav, et. al.Tal Yahav ... Eyal Privman
24 Apr 2019
Scientific Reports | VOL. 9

Automated ensemble assembly and validation of microbial genomes.
Sergey Koren ... Adam M Phillippy
BMC Bioinformatics | VOL. 15
Sergey Koren, et. al.Sergey Koren ... Adam M Phillippy
03 May 2014
BMC Bioinformatics | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Leveraging multiple transcriptome assembly methods for improved gene structure annotation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: GigaScience