Ranked choice voting for representative transcripts with TRaCE.

Andrew J Olson,Doreen Ware,Janet Kelso

doi:10.1093/bioinformatics/btab542

Abstract

SummaryGenome sequencing projects annotate protein-coding gene models with multiple transcripts, aiming to represent all of the available transcript evidence. However, downstream analyses often operate on only one representative transcript per gene locus, sometimes known as the canonical transcript. To choose canonical transcripts, Transcript Ranking and Canonical Election (TRaCE) holds an ‘election’ in which a set of RNA-seq samples rank transcripts by annotation edit distance. These sample-specific votes are tallied along with other criteria such as protein length and InterPro domain coverage. The winner is selected as the canonical transcript, but the election proceeds through multiple rounds of voting to order all the transcripts by relevance. Based on the set of expression data provided, TRaCE can identify the most common isoforms from a broad expression atlas or prioritize alternative transcripts expressed in specific contexts.Availability and implementationTranscript ranking code can be found on GitHub at {{https://github.com/warelab/TRaCE}}.Supplementary information Supplementary data are available at Bioinformatics online.

Highlights

Genome sequencing projects often use complex, automated annotation pipelines to build reference sets of gene models
The winner is selected as the canonical transcript, but the election proceeds through multiple rounds of voting to order all the transcripts by relevance
Before a project releases a set of high-confidence gene models, additional filtering steps may remove transcript models that lack homology or are subject to nonsensemediated degradation (NMD)

Summary

Introduction

Genome sequencing projects often use complex, automated annotation pipelines to build reference sets of gene models. These pipelines mask repeats in the assembled genome, align protein and transcript evidence, and build gene models by aggregating overlapping alignments that adhere to known or inferred splice site patterns (Hoff et al 2019; Campbell et al.2014; Haas et al 2003). 2003); and new sequencing technology such as PacBio IsoSeq can capture splice variants at an unprecedented scale (Wang et al 2016; Zhang et al 2019; Bruijnesteijn et al 2018). This heightened sensitivity can lead to the detection of transcriptional noise, which can be misreported by gene builders as biologically. It is possible for partially processed transcripts containing retained introns that neither disrupt the reading frame nor introduce stop codons to be promoted to canonical transcripts (Figure 1)

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Bioinformatics (Oxford, England)	Publication Date: Jul 23, 2021
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Ranked choice voting for representative transcripts with TRaCE.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)

Lead the way for us

Similar Papers

Implications of gene tree heterogeneity on downstream phylogenetic analyses: A case study employing the Fair Proportion index.
Kristina Wicke ... Ruriko Yoshida
PloS one | VOL. 19
Kristina Wicke, et. al.Kristina Wicke ... Ruriko Yoshida
25 Apr 2024
PloS one | VOL. 19

Structural variant analysis for linked-read sequencing data with gemtools.
S U Greer ... Inanc Birol
Bioinformatics (Oxford, England) | VOL. 35
S U Greer, et. al.S U Greer ... Inanc Birol
02 Apr 2019
Bioinformatics (Oxford, England) | VOL. 35

RNA-SeQC: RNA-seq metrics for quality control and process optimization
David S Deluca ... Michael Reich
Bioinformatics | VOL. 28
David S Deluca, et. al.David S Deluca ... Michael Reich
25 Apr 2012
Bioinformatics | VOL. 28

A Mutant RNA Polymerase Reveals a Kinetic Mechanism for the Switch between Nonproductive Stuttering Synthesis and Productive Initiation during Promoter Clearance
Ding Jun Jin
Journal of Biological Chemistry | VOL. 271
Ding Jun JinDing Jun Jin
01 May 1996
Journal of Biological Chemistry | VOL. 271

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ranked choice voting for representative transcripts with TRaCE.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)