A multi-split mapping algorithm for circular RNA, splicing, trans-splicing and fusion detection.

Steve Hoffmann,Jörg Hackermüller,Daniel Teupser,Sabina Christ,Peter F Stadler,David Langenberger,Christian Otto,Lesca M Holdt,Gero Doose,Andrea Tanzer,Manfred Kunz

doi:10.1186/gb-2014-15-2-r34

Abstract

Numerous high-throughput sequencing studies have focused on detecting conventionally spliced mRNAs in RNA-seq data. However, non-standard RNAs arising through gene fusion, circularization or trans-splicing are often neglected. We introduce a novel, unbiased algorithm to detect splice junctions from single-end cDNA sequences. In contrast to other methods, our approach accommodates multi-junction structures. Our method compares favorably with competing tools for conventionally spliced mRNAs and, with a gain of up to 40% of recall, systematically outperforms them on reads with multiple splits, trans-splicing and circular products. The algorithm is integrated into our mapping tool segemehl (http://www.bioinf.uni-leipzig.de/Software/segemehl/).

Highlights

The term splicing refers to a post-transcriptional process in which the raw transcript is cleaved from intronic DNA fragments
While the overwhelming majority of splicing events occurs within the same pre-mRNA at consensus splice sites, some mRNAs are spliced at non-consensus sites
For a read of length m, the algorithm evaluates the best alignments with a limited number of mismatches, insertions and deletions for all 2(m − ) suffixes of the read and its reverse complement, where is the minimum suffix length

Summary

Introduction

The term splicing refers to a post-transcriptional process in which the raw transcript (pre-mRNA) is cleaved from intronic DNA fragments. Many transcripts derived at non-consensus splice sites may have escaped detection in the past because of the assumptions built into the in silico analysis pipelines or due to the limited throughput of earlier RNA sequencing (RNA-seq) protocols. The original version of TopHat [9] predicts exon locations from the coverage data and attempts split read alignments across neighboring exons. This algorithm was not able to detect fusion events, so a new algorithm, TopHat-Fusion [10], was published and has since been integrated into TopHat along with some other modifications to the original algorithm. The tags are aligned to exons and junctions inferred from tags mapping to consecutive exons

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Genome Biology	Publication Date: Jan 1, 2014
Citations: 285	License type: cc-by

R Discovery Prime

R Discovery Prime

A multi-split mapping algorithm for circular RNA, splicing, trans-splicing and fusion detection.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Genome Biology

Lead the way for us

Similar Papers

Comprehensive Multi-Omics Analysis of Gene Fusions in a Large Multiple Myeloma Cohort
Steven M Foltz ... Li Ding
Blood | VOL. 132
Steven M Foltz, et. al.Steven M Foltz ... Li Ding
29 Nov 2018
Blood | VOL. 132

Noncanonical Gene Fusions Detected at the DNA Level Necessitate Orthogonal Diagnosis Methods Before Targeted Therapy.
Zhengbo Song ... Chenyu Lu
Journal of Thoracic Oncology | VOL. 16
Zhengbo Song, et. al.Zhengbo Song ... Chenyu Lu
01 Mar 2021
Journal of Thoracic Oncology | VOL. 16

Abstract 5478: CICERO: An accurate method for detecting complex and diverse driver fusions using cancer transcriptome sequencing (RNA-seq) data
Liqing Tian ... David W Ellison
Cancer Research | VOL. 80
Liqing Tian, et. al.Liqing Tian ... David W Ellison
13 Aug 2020
Cancer Research | VOL. 80

Diagnostic Validation of a Clinical Laboratory-Oriented Targeted RNA Sequencing System As a Comprehensive Assay for Hematologic Malignancies
Ha Jin Lim ... Myung-Geun Shin
Blood | VOL. 136
Ha Jin Lim, et. al.Ha Jin Lim ... Myung-Geun Shin
05 Nov 2020
Blood | VOL. 136

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A multi-split mapping algorithm for circular RNA, splicing, trans-splicing and fusion detection.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Genome Biology