Constructing large suffix trees on a computational grid

Chunxi Chen,Bertil Schmidt

doi:10.1016/j.jpdc.2006.08.004

Abstract

The suffix tree is a key data structure for biological sequence analysis, since it permits efficient solutions to many string-based problems. Constructing large suffix trees is challenging because of high memory overheads and poor memory locality. Even though efficient suffix tree construction algorithms exist, their run-time is still very high for long DNA sequences such as whole human chromosomes. In this paper, we are using a hierarchical grid system as a computational platform in order to reduce this run-time significantly. To achieve an efficient mapping onto this type of architecture we introduce a parallel suffix tree construction algorithm that makes use of a new data structure called the common prefix suffix tree. Using this algorithm together with a dynamic load balancing strategy we show that our distributed grid implementation leads to significant run-time savings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Constructing large suffix trees on a computational grid

Abstract

Talk to us

Similar Papers

More From: Journal of Parallel and Distributed Computing

Lead the way for us

Journal: Journal of Parallel and Distributed Computing	Publication Date: Sep 28, 2006
Citations: 6

Similar Papers

Parallel Construction of Large Suffix Trees on a PC Cluster
Chunxi Chen ... Bertil Schmidt
-
Chunxi Chen, et. al.Chunxi Chen ... Bertil Schmidt
01 Jan 2004
01 Jan 2004

Constructing suffix tree for gigabyte sequences with megabyte memory
Ching-Fung Cheung ... Hongjun Lu
IEEE Transactions on Knowledge and Data Engineering | VOL. 17
Ching-Fung Cheung, et. al. Ching-Fung Cheung ... Hongjun Lu
01 Jan 2004
IEEE Transactions on Knowledge and Data Engineering | VOL. 17

Practical methods for constructing suffix trees
Yuanyuan Tian ... Richard A Hankins
The VLDB Journal | VOL. 14
Yuanyuan Tian, et. al.Yuanyuan Tian ... Richard A Hankins
01 Sep 2005
The VLDB Journal | VOL. 14

I/O efficient algorithms for serial and parallel suffix tree construction
Amol Ghoting ... Konstantin Makarychev
ACM Transactions on Database Systems | VOL. 35
Amol Ghoting, et. al.Amol Ghoting ... Konstantin Makarychev
12 Oct 2010
ACM Transactions on Database Systems | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Constructing large suffix trees on a computational grid

Abstract

Talk to us

Similar Papers

More From: Journal of Parallel and Distributed Computing