Holistic optimization of an RNA-seq workflow for multi-threaded environments.

Ling-Hong Hung,Yuguang Xiong,Saranya Devi Athmalingam Ravishankar,Ka Yee Yeung,Radhika Agumbe Sridhar,Eric Sobie,Wes Lloyd

doi:10.1093/bioinformatics/btz169

Ling-Hong Hung, Yuguang Xiong + Show 5 more

Open Access

https://doi.org/10.1093/bioinformatics/btz169

Copy DOI

Abstract

For many next generation-sequencing pipelines, the most computationally intensive step is the alignment of reads to a reference sequence. As a result, alignment software such as the Burrows-Wheeler Aligner is optimized for speed and is often executed in parallel on the cloud. However, there are other less demanding steps that can also be optimized to significantly increase the speed especially when using many threads. We demonstrate this using a unique molecular identifier RNA-sequencing pipeline consisting of 3 steps: split, align, and merge. Optimization of all three steps yields a 40% increase in speed when executed using a single thread. However, when executed using 16 threads, we observe a 4-fold improvement over the original parallel implementation and more than an 8-fold improvement over the original single-threaded implementation. In contrast, optimizing only the alignment step results in just a 13% improvement over the original parallel workflow using 16 threads. Code (M.I.T. license), supporting scripts and Dockerfiles are available at https://github.com/BioDepot/LINCS_RNAseq_cpp and Docker images at https://hub.docker.com/r/biodepot/rnaseq-umi-cpp/. Supplementary data are available at Bioinformatics online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Bioinformatics (Oxford, England)	Publication Date: Mar 11, 2019
Citations: 5	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Holistic optimization of an RNA-seq workflow for multi-threaded environments.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)

Lead the way for us

Similar Papers

Improving the Thread Scalability and Parallelism of BWA-MEM on Intel HPC Platforms
Xinyuan Li ... Lin Xu
-
Xinyuan Li, et. al.Xinyuan Li ... Lin Xu
01 Aug 2019
01 Aug 2019

NINJA-OPS: Fast Accurate Marker Gene Alignment Using Concatenated Ribosomes.
Gabriel A Al-Ghalith ... Emmanuel Montassier
PLOS Computational Biology | VOL. 12
Gabriel A Al-Ghalith, et. al.Gabriel A Al-Ghalith ... Emmanuel Montassier
28 Jan 2016
PLOS Computational Biology | VOL. 12

SmCounter2: an accurate low-frequency variant caller for targeted sequencing data with unique molecular identifiers.
Chang Xu ... Quan Peng
Bioinformatics | VOL. 35
Chang Xu, et. al.Chang Xu ... Quan Peng
06 Sep 2018
Bioinformatics | VOL. 35

Genome-Wide Association Study Identifies Genetic Variants Associated with Rotator Cuff Tear-A Pilot Study.
Hyun-Ju An ... Jeongmo Koo
Diagnostics | VOL. 12
Hyun-Ju An, et. al.Hyun-Ju An ... Jeongmo Koo
15 Oct 2022
Diagnostics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Holistic optimization of an RNA-seq workflow for multi-threaded environments.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)