A Data Parallel Strategy for Aligning Multiple Biological Sequences on Homogeneous Multiprocessor Platform

Xiangyuan Zhu,Kenli Li,Renfa Li

doi:10.1109/chinagrid.2011.42

Abstract

In this paper, we address the biological sequence alignment problem, which is a fundamental operation performed in computational biology. We employ the data parallelism paradigm that is suitable for handling large-scale processing to achieve a high degree of parallelism. Using data parallelism, we propose a strategy in which we employ a parallel clustering scheme to partition the set of sequences into subsets based on sequence similarity. Then the subsets are distributed among the processors using a heuristic algorithm based on Integer Programming so as to minimize the overall processing time, and each subset can be independently aligned in parallel using any sequential approach. The global alignment is achieved using a progressive profile-profile alignment within and between the processors. We implement the proposed algorithm on a cluster using the MPI library, and analyze the experimental results for different problem sizes in terms of quality of alignment, execution time and speed-up.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Data Parallel Strategy for Aligning Multiple Biological Sequences on Homogeneous Multiprocessor Platform

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A domain decomposition strategy for alignment of multiple biological sequences on multiprocessor platforms
Fahad Saeed ... Ashfaq Khokhar
Journal of Parallel and Distributed Computing | VOL. 69
Fahad Saeed, et. al.Fahad Saeed ... Ashfaq Khokhar
05 Apr 2009
Journal of Parallel and Distributed Computing | VOL. 69

A data parallel strategy for aligning multiple biological sequences on multi-core computers
Xiangyuan Zhu ... Ahmad Salah
Computers in Biology and Medicine | VOL. 43
Xiangyuan Zhu, et. al.Xiangyuan Zhu ... Ahmad Salah
14 Feb 2013
Computers in Biology and Medicine | VOL. 43

Comparing Integer Linear Programming to SAT-Solving for Hard Problems in Computational and Systems Biology
Hannah Brown ... Dan Gusfield
-
Hannah Brown, et. al.Hannah Brown ... Dan Gusfield
01 Jan 2020
01 Jan 2020

Graphic user interface based implementation of longest common subsequence problem in DNA sequencing
Arpan Kumar ... Sarbajit Manna
-
Arpan Kumar, et. al.Arpan Kumar ... Sarbajit Manna
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Data Parallel Strategy for Aligning Multiple Biological Sequences on Homogeneous Multiprocessor Platform

Abstract

Talk to us

Similar Papers