MASA‐OpenCL: Parallel pruned comparison of long DNA sequences with OpenCL

Marco Antonio C Figueiredo,George L M Teodoro,Edans F Oliveira Sandes,Alba Cristina M A Melo,Genaina N Rodrigues

doi:10.1002/cpe.5039

Abstract

SummaryBiological sequence comparison is often used as an auxiliary task in the analysis of genetic material. Pairwise comparison algorithms like Smith‐Waterman evaluate two strings representing sequences of proteins, DNA or RNA to obtain optimal alignment between them. Many applications have been proposed to address the sequence comparison problem, prioritizing the use of graphics cards and proprietary languages such as CUDA. In this paper, we propose and evaluate MASA‐OpenCL, an OpenCL solution for comparing long DNA sequences that is based on the MASA sequence alignment framework, with pruning capability proportional to the similarity of the sequences compared. The results of MASA‐OpenCL were compared to its CUDA counterpart (MASA‐CUDAlign) and, in most cases, MASA‐OpenCL achieved better performance. In order to better understand the behavior of MASA‐OpenCL, we performed a statistical analysis considering 11 comparisons of sequences with high, medium and low similarity in 4 GPUs. As a result, we obtained a multiple linear regression model that considers (a) the sizes of the sequences, (b) the similarity between them, (c) the computational power of the GPU, and (d) the GPU memory bandwidth. We used this model to predict the performance in two other GPUs, with low error rates.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MASA‐OpenCL: Parallel pruned comparison of long DNA sequences with OpenCL

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience

Lead the way for us

Similar Papers

Applied Biostatistics for the Health Sciences
Richard J Rossi
-
Richard J RossiRichard J Rossi
28 Mar 2022
28 Mar 2022

Evaluation of shadow effects in satellite images of vineyards with different row orientation
Marco Sozzi ... Francesco Marinello
-
Marco Sozzi, et. al.Marco Sozzi ... Francesco Marinello
01 Oct 2019
01 Oct 2019

A measure of DNA sequence similarity by Fourier Transform with applications on hierarchical clustering
Changchuan Yin ... Stephen S.-T Yau
Journal of Theoretical Biology | VOL. 359
Changchuan Yin, et. al.Changchuan Yin ... Stephen S.-T Yau
06 Jun 2014
Journal of Theoretical Biology | VOL. 359

Identification of genes for lignin peroxidases and manganese peroxidases in ectomycorrhizal fungi.
David M Chen ... John W G Cairney
New Phytologist | VOL. 152
David M Chen, et. al.David M Chen ... John W G Cairney
01 Oct 2001
New Phytologist | VOL. 152

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MASA‐OpenCL: Parallel pruned comparison of long DNA sequences with OpenCL

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience