Arioc: High-concurrency short-read alignment on multiple GPUs.

Richard Wilton,Alexander S Szalay

doi:10.1371/journal.pcbi.1008383

Richard Wilton, Alexander S Szalay

Open Access

https://doi.org/10.1371/journal.pcbi.1008383

Copy DOI

Journal: PLOS Computational Biology	Publication Date: Nov 9, 2020
Citations: 10	License type: CC BY 4.0

Affiliation: Johns Hopkins University

Abstract

In large DNA sequence repositories, archival data storage is often coupled with computers that provide 40 or more CPU threads and multiple GPU (general-purpose graphics processing unit) devices. This presents an opportunity for DNA sequence alignment software to exploit high-concurrency hardware to generate short-read alignments at high speed. Arioc, a GPU-accelerated short-read aligner, can compute WGS (whole-genome sequencing) alignments ten times faster than comparable CPU-only alignment software. When two or more GPUs are available, Arioc's speed increases proportionately because the software executes concurrently on each available GPU device. We have adapted Arioc to recent multi-GPU hardware architectures that support high-bandwidth peer-to-peer memory accesses among multiple GPUs. By modifying Arioc's implementation to exploit this GPU memory architecture we obtained a further 1.8x-2.9x increase in overall alignment speeds. With this additional acceleration, Arioc computes two million short-read alignments per second in a four-GPU system; it can align the reads from a human WGS sequencer run-over 500 million 150nt paired-end reads-in less than 15 minutes. As WGS data accumulates exponentially and high-concurrency computational resources become widespread, Arioc addresses a growing need for timely computation in the short-read data analysis toolchain.

Highlights

Short-read DNA sequencing technology has been estimated to generate 35 petabases of DNA sequencer data per year [1], with the amount of new data increasing exponentially [2]
Highthroughput computational resources are becoming an integral part of the local computing environment in data centers that archive sequencing data
We carried out speed-versus-sensitivity experiments using all four sets of sequencer reads and with all three lookup tables (LUTs) layouts in GPU memory

Summary

Introduction

Short-read DNA sequencing technology has been estimated to generate 35 petabases of DNA sequencer data per year [1], with the amount of new data increasing exponentially [2]. When GPU memory is large enough to contain these tables and P2P memory interconnect is supported, LUT data accesses execute 10 or more times faster and the overall speed of the software increases .

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Arioc: High-concurrency short-read alignment on multiple GPUs.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

Arioc: High-concurrency short-read alignment on multiple GPUs
Manja Marz ... Richard Wilton
-
Manja Marz, et. al.Manja Marz ... Richard Wilton
09 Nov 2020
09 Nov 2020

Towards Energy-Efficient Real-Time Scheduling of Heterogeneous Multi-GPU Systems
Yidi Wang ... Hyoseung Kim
-
Yidi Wang, et. al.Yidi Wang ... Hyoseung Kim
01 Dec 2022
01 Dec 2022

Rapid Computation Of Permeability From Micro-CT Images On GPGPUs
F.O Alpak ... M Araya-Polo
-
F.O Alpak, et. al.F.O Alpak ... M Araya-Polo
03 Sep 2018
03 Sep 2018

Comment on gmd-2022-141
-
-
--
13 Sep 2022
Comment on gmd-2022-141
-

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Arioc: High-concurrency short-read alignment on multiple GPUs.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology