Next-generation massively parallel short-read mapping on FPGAs

Oliver Knodel,Rainer G Spallek,Thomas B Preusser

doi:10.1109/asap.2011.6043268

Abstract

The mapping of DNA sequences to huge genome databases is an essential analysis task in modern molecular biology. Having linearized reference genomes available, the alignment of short DNA reads obtained from the sequencing of an individual genome against such a database provides a powerful diagnostic and analysis tool. In essence, this task amounts to a simple string search tolerating a certain number of mismatches to account for the diversity of individuals. The complexity of this process arises from the sheer size of the reference genome. It is further amplified by current next-generation sequencing technologies, which produce a huge number of increasingly short reads. These short reads hurt established alignment heuristics like BLAST severely. This paper proposes an FPGA-based custom computation, which performs the alignment of short DNA reads in a timely manner by the use of tremendous concurrency for reasonable costs. The special measures to achieve an extremely efficient and compact mapping of the computation to a Xilinx FPGA architecture are described. The presented approach also surpasses all software heuristics in the quality of its results. It guarantees to find all alignment locations of a read in the database while also allowing a freely adjustable character mismatch threshold. On the contrary, advanced fast alignment heuristics like Bowtie and Maq can only tolerate small mismatch maximums with a quick deterioration of the probability to detect existing valid alignments. The performance comparison with these widely used software tools also demonstrates that the proposed FPGA computation achieves its guaranteed exact results in very competitive time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Next-generation massively parallel short-read mapping on FPGAs

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Cancer genomics: new software tools making sequencing more accessible.
En-Guo Chen ... Yan Lu
Personalized Medicine | VOL. 11
En-Guo Chen, et. al.En-Guo Chen ... Yan Lu
01 Mar 2014
Personalized Medicine | VOL. 11

G-SNPM - A GPU-based SNP mapping tool
Alessandro Orro ... Andrea Manconi
EMBnet.journal | VOL. 18
Alessandro Orro, et. al.Alessandro Orro ... Andrea Manconi
09 Nov 2012
EMBnet.journal | VOL. 18

MapNext: a software tool for spliced and unspliced alignments and SNP detection of short sequence reads
Hua Bao ... Yuanyan Xiong
BMC Genomics | VOL. 10
Hua Bao, et. al.Hua Bao ... Yuanyan Xiong
01 Dec 2009
BMC Genomics | VOL. 10

Ψ-RA: a parallel sparse index for genomic read alignment
M Oğuzhan Külekci ... Wing-Kai Hon
BMC Genomics | VOL. 12
M Oğuzhan Külekci, et. al.M Oğuzhan Külekci ... Wing-Kai Hon
01 Jan 2010
BMC Genomics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Next-generation massively parallel short-read mapping on FPGAs

Abstract

Talk to us

Similar Papers