CAFU: a Galaxy framework for exploring unmapped RNA-Seq data.

Siyuan Chen,Wenlong Ma,Zelong Li,Chengzhi Ren,Chuang Ma,Ting Zhang,Zhaoxue Han,Jingjing Zhai,Jiantao Yu,Xuyang Zhao

doi:10.1093/bib/bbz018

Siyuan Chen, Wenlong Ma + Show 8 more

Open Access

https://doi.org/10.1093/bib/bbz018

Copy DOI

Abstract

A widely used approach in transcriptome analysis is the alignment of short reads to a reference genome. However, owing to the deficiencies of specially designed analytical systems, short reads unmapped to the genome sequence are usually ignored, resulting in the loss of significant biological information and insights. To fill this gap, we present Comprehensive Assembly and Functional annotation of Unmapped RNA-Seq data (CAFU), a Galaxy-based framework that can facilitate the large-scale analysis of unmapped RNA sequencing (RNA-Seq) reads from single- and mixed-species samples. By taking advantage of machine learning techniques, CAFU addresses the issue of accurately identifying the species origin of transcripts assembled using unmapped reads from mixed-species samples. CAFU also represents an innovation in that it provides a comprehensive collection of functions required for transcript confidence evaluation, coding potential calculation, sequence and expression characterization and function annotation. These functions and their dependencies have been integrated into a Galaxy framework that provides access to CAFU via a user-friendly interface, dramatically simplifying complex exploration tasks involving unmapped RNA-Seq reads. CAFU has been validated with RNA-Seq data sets from wheat and Zea mays (maize) samples. CAFU is freely available via GitHub: https://github.com/cma2015/CAFU.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Briefings in Bioinformatics	Publication Date: Feb 28, 2019
Citations: 13	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

CAFU: a Galaxy framework for exploring unmapped RNA-Seq data.

Abstract

Talk to us

Similar Papers

More From: Briefings in Bioinformatics

Lead the way for us

Similar Papers

Another lesson from unmapped reads: in-depth analysis of RNA-Seq reads from various horse tissues.
Artur Gurgul ... Zbigniew Arent
Journal of applied genetics | VOL. 63
Artur Gurgul, et. al.Artur Gurgul ... Zbigniew Arent
07 Jun 2022
Journal of applied genetics | VOL. 63

Exploring the unmapped DNA and RNA reads in a songbird genome
Veronika N Laine ... Marcel E Visser
BMC Genomics | VOL. 20
Veronika N Laine, et. al.Veronika N Laine ... Marcel E Visser
08 Jan 2019
BMC Genomics | VOL. 20

What's in your next-generation sequence data? An exploration of unmapped DNA and RNA sequence reads from the bovine reference individual.
Lynsey K Whitacre ... Jeremy F Taylor
BMC Genomics | VOL. 16
Lynsey K Whitacre, et. al.Lynsey K Whitacre ... Jeremy F Taylor
01 Dec 2015
BMC Genomics | VOL. 16

The case for using mapped exonic non-duplicate reads when reporting RNA-sequencing depth: examples from pediatric cancer datasets.
Holly C Beale ... A Geoffrey Lyle
GigaScience | VOL. 10
Holly C Beale, et. al.Holly C Beale ... A Geoffrey Lyle
13 Mar 2021
GigaScience | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CAFU: a Galaxy framework for exploring unmapped RNA-Seq data.

Abstract

Talk to us

Similar Papers

More From: Briefings in Bioinformatics