The Selection of Quantification Pipelines for Illumina RNA-seq Data Using a Subsampling Approach.

Po-Yen Wu,May D Wang

doi:10.1109/bhi.2016.7455839

Abstract

RNA sequencing, or (RNA-seq for short,, is a widely applied technology that for extractings gene and transcript expression from biological samples. Given numerous quantification pipelines for RNA-seq data, one fundamental challenge is to determine identify a pipeline that can produce the most accurate estimate the most accurate gene and/or transcript expression. Exploring all available pipelines requires tremendous extensive computational resources, so. Therefore, we propose to use a subsampling approach that can improve speed up the pipeline evaluation and selection the efficiency process of pipeline performance evaluation for a given RNA-seq dataset. We applied our approach to one simulated and two real RNA-seq datasets and found that expression estimates derived from subsampled data are close surrogates for those derived from original data. In addition, the ranking of quantification pipelines based on the subsampled data was highly correlated concordant with that based on the original data. Therefore, we conclude that subsampling is a valid approach to facilitating efficient quantification pipeline selection using RNA-seq data.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Selection of Quantification Pipelines for Illumina RNA-seq Data Using a Subsampling Approach.

Abstract

Talk to us

Similar Papers

More From: ... IEEE-EMBS International Conference on Biomedical and Health Informatics. IEEE-EMBS International Conference on Biomedical and Health Informatics

Lead the way for us

Journal: ... IEEE-EMBS International Conference on Biomedical and Health Informatics. IEEE-EMBS International Conference on Biomedical and Health Informatics	Publication Date: Feb 1, 2016
Citations: 9

Similar Papers

Performance evaluation of lossy quality compression algorithms for RNA-seq data
Rongshan Yu ... Wenxian Yang
BMC bioinformatics | VOL. 21
Rongshan Yu, et. al.Rongshan Yu ... Wenxian Yang
20 Jul 2020
BMC bioinformatics | VOL. 21

A comparative study of RNA-Seq and microarray data analysis on the two examples of rectal-cancer patients and Burkitt Lymphoma cells.
Alexander Wolff ... Petr V Nazarov
PloS one | VOL. 13
Alexander Wolff, et. al.Alexander Wolff ... Petr V Nazarov
16 May 2018
PloS one | VOL. 13

Abstract 1817: Differential expression of long non-coding RNA in colon adenocarcinoma RNA-sequence data set
Stephen J O'Brien ... Theodore Kalbfleisch
American Journal of Cancer | VOL. 79
Stephen J O'Brien, et. al.Stephen J O'Brien ... Theodore Kalbfleisch
01 Jul 2019
Abstract 1817: Differential expression of long non-coding RNA in colon adenocarcinoma RNA-sequence data set
Stephen J O'Brien ... Theodore Kalbfleisch

Selecting Classification Methods for Small Samples of Next-Generation Sequencing Data.
Jiadi Zhu ... Lianjie Shu
Frontiers in Genetics | VOL. 12
Jiadi Zhu, et. al.Jiadi Zhu ... Lianjie Shu
04 Mar 2021
Frontiers in Genetics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Selection of Quantification Pipelines for Illumina RNA-seq Data Using a Subsampling Approach.

Abstract

Talk to us

Similar Papers

More From: ... IEEE-EMBS International Conference on Biomedical and Health Informatics. IEEE-EMBS International Conference on Biomedical and Health Informatics