Diamond Tiling: Tiling Techniques to Maximize Parallelism for Stencil Computations

Uday Bondhugula,Vinayaka Bandishti,Irshad Pananilath

doi:10.1109/tpds.2016.2615094

Abstract

Most stencil computations allow tile-wise concurrent start, i.e., there always exists a face of the iteration space and a set of tiling directions such that all tiles along that face can be started concurrently. This provides load balance and maximizes parallelism. However, existing automatic tiling frameworks often choose hyperplanes that lead to pipelined start-up and load imbalance. We address this issue with a new tiling technique, called diamond tiling, that ensures concurrent start-up as well as perfect load-balance whenever possible. We first provide necessary and sufficient conditions for a set of tiling hyperplanes to allow concurrent start for programs with affine data accesses. We then provide an approach to automatically find such hyperplanes. Experimental evaluation on a 12-core Intel Westmere shows that diamond tiled code is able to outperform a tuned domain-specific stencil code generator by 10 to 40 percent, and previous compiler techniques by a factor of 1.3 $\times$ to 10.1 $\times$ .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Diamond Tiling: Tiling Techniques to Maximize Parallelism for Stencil Computations

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems

Lead the way for us

Journal: IEEE Transactions on Parallel and Distributed Systems	Publication Date: May 1, 2017
Citations: 52

Similar Papers

Tiling stencil computations to maximize parallelism
Vinayaka Bandishti ... Uday Bondhugula
-
Vinayaka Bandishti, et. al.Vinayaka Bandishti ... Uday Bondhugula
01 Nov 2012
01 Nov 2012

Tiling stencil computations to maximize parallelism
...
-
, et. al. ...
10 Nov 2012
10 Nov 2012

Parameterized Diamond Tiling for Stencil Computations with Chapel parallel iterators
Ian J Bertolacci ... Ben Harshbarger
-
Ian J Bertolacci, et. al.Ian J Bertolacci ... Ben Harshbarger
08 Jun 2015
08 Jun 2015

Locality aware concurrent start for stencil applications
Sunil Shrestha ... Andres Marquez
-
Sunil Shrestha, et. al.Sunil Shrestha ... Andres Marquez
01 Feb 2015
01 Feb 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Diamond Tiling: Tiling Techniques to Maximize Parallelism for Stencil Computations

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems