A Conceptual Framework for Abundance Estimation of Genomic Targets in the Presence of Ambiguous Short Sequencing Reads

Katarzyna Górczak,Jürgen Claesen,Tomasz Burzykowski

doi:10.1089/cmb.2019.0272

Abstract

RNA sequencing (RNA-seq) is widely used to study gene-, transcript-, or exon expression. To quantify the expression level, millions of short sequenced reads need to be mapped back to a reference genome or transcriptome. Read mapping makes it possible to find a location to which a read is identical or similar. Based upon this alignment, expression summaries, that is, read counts are generated. However, reads may be matched to multiple locations. Such ambiguously mapped reads are often ignored in the analysis, which is a potential loss of information and may cause bias in expression estimation. We present the general principles underlying multiread allocation and unbiased estimation of the expression level of genes, exons, or transcripts in the presence of multiple mapped reads. The underlying principles are derived from a theoretical concept that identifies important sources of information such as the number of uniquely mapped reads, the total target length, and the length of the shared target regions. We show with simulation studies that methods incorporating some or all of the aforementioned sources of information estimate the expression levels of genes, exons, and/or transcripts with a higher precision and accuracy than methods that do not use this information. We identify important sources of information that should be taken into account by methods that estimate the abundance of genes, exons, and/or transcripts to achieve good precision and accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Conceptual Framework for Abundance Estimation of Genomic Targets in the Presence of Ambiguous Short Sequencing Reads

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Biology

Lead the way for us

Journal: Journal of Computational Biology	Publication Date: Dec 31, 2019
Citations: 1

Similar Papers

An improved approach for reconstructing consensus repeats from short sequence reads
Chong Chu ... Jingwen Pei
BMC genomics | VOL. 19
Chong Chu, et. al.Chong Chu ... Jingwen Pei
01 Aug 2018
BMC genomics | VOL. 19

De novo assembly of short sequence reads
K Paszkiewicz ... D J Studholme
Briefings in Bioinformatics | VOL. 11
K Paszkiewicz, et. al.K Paszkiewicz ... D J Studholme
19 Aug 2010
Briefings in Bioinformatics | VOL. 11

Microindel detection in short-read sequence data
Peter Krawitz ... Marten Jäger
Bioinformatics | VOL. 26
Peter Krawitz, et. al.Peter Krawitz ... Marten Jäger
09 Feb 2010
Bioinformatics | VOL. 26

G-SNPM - A GPU-based SNP mapping tool
Alessandro Orro ... Andrea Manconi
EMBnet.journal | VOL. 18
Alessandro Orro, et. al.Alessandro Orro ... Andrea Manconi
09 Nov 2012
EMBnet.journal | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Conceptual Framework for Abundance Estimation of Genomic Targets in the Presence of Ambiguous Short Sequencing Reads

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Biology