Improving Performance of Dynamic Programming via Parallelism and Locality on Multicore Architectures

Guangming Tan Guangming Tan,G.R Gao,Ninghui Sun Ninghui Sun

doi:10.1109/tpds.2008.78

Abstract

Dynamic programming (DP) is a popular technique which is used to solve combinatorial search and optimization problems. This paper focuses on one type of DP, which is called nonserial polyadic dynamic programming (NPDP). Owing to the nonuniform data dependencies of NPDP, it is difficult to exploit either parallelism or locality. Worse still, the emerging multi/many-core architectures with small on-chip memory make these issues more challenging. In this paper, we address the challenges of exploiting the fine grain parallelism and locality of NPDP on multicore architectures. We describe a latency-tolerant model and a percolation technique for programming on multicore architectures. On an algorithmic level, both parallelism and locality do benefit from a specific data dependence transformation of NPDP. Next, we propose a parallel pipelining algorithm by decomposing computation operators and percolating data through a memory hierarchy to create just-in-time locality. In order to predict the execution time, we formulate an analytical performance model of the parallel algorithm. The parallel pipelining algorithm achieves not only high scalability on the 160-core IBM Cyclops64, but portable performance as well, across the 8-core Sun Niagara and quad-cores Intel Clovertown.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving Performance of Dynamic Programming via Parallelism and Locality on Multicore Architectures

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems

Lead the way for us

Journal: IEEE Transactions on Parallel and Distributed Systems	Publication Date: Feb 1, 2009
Citations: 63

Similar Papers

A parallel dynamic programming algorithm on a multi-core architecture
Guangming Tan ... Guang R Gao
-
Guangming Tan, et. al.Guangming Tan ... Guang R Gao
09 Jun 2007
09 Jun 2007

Adjusting Thread Parallelism Dynamically to Accelerate Dynamic Programming with Irregular Workload Distribution on GPGPUs
Chao-Chin Wu ... Syun-Sheng Jhan
International Journal of Grid and High Performance Computing | VOL. 6
Chao-Chin Wu, et. al.Chao-Chin Wu ... Syun-Sheng Jhan
01 Jan 2014
International Journal of Grid and High Performance Computing | VOL. 6

Efficient Nonserial Polyadic Dynamic Programming on the Cell Processor
Li Liu ... Ruizhe Li
-
Li Liu, et. al.Li Liu ... Ruizhe Li
01 May 2011
01 May 2011

Optimizing Dynamic Programming on Graphics Processing Units Via Data Reuse and Data Prefetch with Inter-Block Barrier Synchronization
Chao-Chin Wu ... Kai-Cheng Wei
-
Chao-Chin Wu, et. al.Chao-Chin Wu ... Kai-Cheng Wei
01 Dec 2012
01 Dec 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Performance of Dynamic Programming via Parallelism and Locality on Multicore Architectures

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems