An Improvement to Affine Decomposition on Distributed Memory Architecture

Ding Rui,Zhao Rongcai,Liu Xiaoxian

doi:10.1109/dcabes.2012.9

Abstract

Automatic decomposition is an optimization technique that distributes computation and data onto different processors. The consequence of decomposition directly affects the performance of parallel program. Since every computing node has its own memory in distributed memory parallel computers (DMPCs), false dependence does not hinder the parallelism. Affine decomposition is an effective method to represent and derive computation partition and data distribution, and its principle of adding dependence constraint is too strict to gain more parallelism. Some loop nests do not satisfy the affine condition, and are prohibited from parallelism by affine decomposition. However, if only the irregular access is caused by indirect array, loop and array reference can be partitioned at compile time. To tackle above problems of affine decomposition, an improved static decomposition algorithm of DMPCs proposed in this paper. The experimental results show that this algorithm can improve the performance of parallel programs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Improvement to Affine Decomposition on Distributed Memory Architecture

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An Automatic Computation and Data Decomposition Algorithm of Prioritized Dominant Array
Rui Ding ... Lin Han
-
Rui Ding, et. al.Rui Ding ... Lin Han
01 Dec 2012
01 Dec 2012

Influence of regular system interrupts on performance of parallel stencil computations

Programming and Computer Software | VOL. 40

01 Sep 2014
Programming and Computer Software | VOL. 40

Automatic data and computation decomposition on distributed memory parallel computers
Peizong Lee ... Zvi Meir Kedem
ACM Transactions on Programming Languages and Systems | VOL. 24
Peizong Lee, et. al.Peizong Lee ... Zvi Meir Kedem
01 Jan 2002
ACM Transactions on Programming Languages and Systems | VOL. 24

Communication Benchmarking and Performance Modelling of MPI Programs on Cluster Computers
D A Grove ... P D Coddington
The Journal of Supercomputing | VOL. 34
D A Grove, et. al.D A Grove ... P D Coddington
01 Nov 2005
The Journal of Supercomputing | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Improvement to Affine Decomposition on Distributed Memory Architecture

Abstract

Talk to us

Similar Papers