A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs.

Charlotte Soneson,Dheeraj Malhotra,Shobbir Hussain,Rob Patro,Mark D Robinson,Michael I Love

doi:10.26508/lsa.201800175

Abstract

Most methods for statistical analysis of RNA-seq data take a matrix of abundance estimates for some type of genomic features as their input, and consequently the quality of any obtained results is directly dependent on the quality of these abundances. Here, we present the junction coverage compatibility score, which provides a way to evaluate the reliability of transcript-level abundance estimates and the accuracy of transcript annotation catalogs. It works by comparing the observed number of reads spanning each annotated splice junction in a genomic region to the predicted number of junction-spanning reads, inferred from the estimated transcript abundances and the genomic coordinates of the corresponding annotated transcripts. We show that although most genes show good agreement between the observed and predicted junction coverages, there is a small set of genes that do not. Genes with poor agreement are found regardless of the method used to estimate transcript abundances, and the corresponding transcript abundances should be treated with care in any downstream analyses.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Life Science Alliance	Publication Date: Jan 17, 2019
Citations: 20	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs.

Abstract

Talk to us

Similar Papers

More From: Life Science Alliance

Lead the way for us

Similar Papers

Comparing the normalization methods for the differential analysis of Illumina high-throughput RNA-Seq data.
Peipei Li ... Ho Sun Shon
BMC Bioinformatics | VOL. 16
Peipei Li, et. al.Peipei Li ... Ho Sun Shon
28 Oct 2015
BMC Bioinformatics | VOL. 16

Computational methods for the identification and quantification of transcript isoforms from next generation sequencing data

-

01 Jan 2018
01 Jan 2018

Estimation of non-shivering thermogenesis and cold-induced nutrient oxidation rates: Impact of method for data selection and analysis
Guillermo Sanchez-Delgado ... Jonatan R Ruiz
Clinical Nutrition | VOL. 38
Guillermo Sanchez-Delgado, et. al.Guillermo Sanchez-Delgado ... Jonatan R Ruiz
18 Sep 2018
Clinical Nutrition | VOL. 38

LTE/LTE-A Network Security Data Collection and Analysis for Security Measurement: A Survey
Limei He ... Mohammed Atiquzzaman
IEEE Access | VOL. 6
Limei He, et. al.Limei He ... Mohammed Atiquzzaman
01 Jan 2018
IEEE Access | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs.

Abstract

Talk to us

Similar Papers

More From: Life Science Alliance