Benchmark analysis of algorithms for determining and quantifying full-length mRNA splice forms from RNA-seq data.

Katharina E Hayer,Angel Pizarro,Gregory R Grant,Nicholas F Lahens,John B Hogenesch

doi:10.1093/bioinformatics/btv488

Katharina E Hayer, Angel Pizarro + Show 3 more

Open Access

https://doi.org/10.1093/bioinformatics/btv488

Copy DOI

Abstract

Motivation: Because of the advantages of RNA sequencing (RNA-Seq) over microarrays, it is gaining widespread popularity for highly parallel gene expression analysis. For example, RNA-Seq is expected to be able to provide accurate identification and quantification of full-length splice forms. A number of informatics packages have been developed for this purpose, but short reads make it a difficult problem in principle. Sequencing error and polymorphisms add further complications. It has become necessary to perform studies to determine which algorithms perform best and which if any algorithms perform adequately. However, there is a dearth of independent and unbiased benchmarking studies. Here we take an approach using both simulated and experimental benchmark data to evaluate their accuracy.Results: We conclude that most methods are inaccurate even using idealized data, and that no method is highly accurate once multiple splice forms, polymorphisms, intron signal, sequencing errors, alignment errors, annotation errors and other complicating factors are present. These results point to the pressing need for further algorithm development.Availability and implementation: Simulated datasets and other supporting information can be found at http://bioinf.itmat.upenn.edu/BEERS/bp2Supplementary information: Supplementary data are available at Bioinformatics online.Contact: hayer@upenn.edu

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Bioinformatics	Publication Date: Sep 3, 2015
Citations: 97	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Benchmark analysis of algorithms for determining and quantifying full-length mRNA splice forms from RNA-seq data.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Similar Papers

Genome-wide signals of positive selection in strongylocentrotid sea urchins
Kord M Kober ... Grant H Pogson
BMC Genomics | VOL. 18
Kord M Kober, et. al.Kord M Kober ... Grant H Pogson
21 Jul 2017
BMC Genomics | VOL. 18

Assessment of kinship detection using RNA-seq data.
Natalia Blay ... Iván Galván-Femenía
Nucleic Acids Research | VOL. 47
Natalia Blay, et. al.Natalia Blay ... Iván Galván-Femenía
10 Sep 2019
Nucleic Acids Research | VOL. 47

High-throughput RNA sequencing: a step forward in transcriptome analysis

-

25 Feb 2016
25 Feb 2016

Sequencing accuracy and systematic errors of nanopore direct RNA sequencing
Wang Liu-Wei ... Redmond P Smyth
BMC genomics | VOL. 25
Wang Liu-Wei, et. al.Wang Liu-Wei ... Redmond P Smyth
28 May 2024
BMC genomics | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Benchmark analysis of algorithms for determining and quantifying full-length mRNA splice forms from RNA-seq data.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics