Parameterized tiled loops for free

Lakshminarayanan Renganarayanan,Sanjay Rajopadhye,Michelle Mills Strout,Daegon Kim

doi:10.1145/1250734.1250780

Abstract

Parameterized tiled loops-where the tile sizes are not fixed at compile time, but remain symbolic parameters until later--are quite useful for iterative compilers and auto-tuners that produce highly optimized libraries and codes. Tile size parameterization could also enable optimizations such as register tiling to become dynamic optimizations. Although it is easy to generate such loops for (hyper) rectangular iteration spaces tiled with (hyper) rectangular tiles, many important computations do not fall into this restricted domain. Parameterized tile code generation for the general case of convex iteration spaces being tiled by (hyper) rectangular tiles has in the past been solved with bounding box approaches or symbolic Fourier Motzkin approaches. However, both approaches have less than ideal code generation efficiency and resulting code quality. We present the theoretical foundations, implementation, and experimental validation of a simple, unified technique for generating parameterized tiled code. Our code generation efficiency is comparable to all existing code generation techniques including those for fixed tile sizes, and the resulting code is as efficient as, if not more than, all previous techniques. Thus the technique provides parameterized tiled loops for free! Our one-size-fits-all solution, which is available as open source software can be adapted for use in production compilers.

Highlights

The algorithm implemented on a parallel computer with distributed memory has, as a rule, a tiled structure: a set of operations is divided into subsets, called tiles
One of the modern approaches to obtaining tiled versions of algorithms is a tiling transformation based on information sections of the iteration space, resulting in macro-operations
Compared to one-dimensional structures, the use of two-dimensional structures is possible in a smaller number of cases, but it can have advantages when implementing algorithms on parallel computers with distributed memory

Summary

ПОСТРОЕНИЕ ДВУМЕРНЫХ ЗЕРНИСТЫХ ПАРАЛЛЕЛЬНЫХ ВЫЧИСЛИТЕЛЬНЫХ ПРОЦЕССОВ

Реализуемый на параллельном компьютере с распределенной памятью, имеет, как правило, зернистую структуру: множество операций разбито на подмножества, называемые зернами вычислений. В настоящей работе для алгоритмов, заданных вложенными многомерными циклами, предложен способ построения зернистых вычислительных процессов, логически организованных в двумерную структуру. По сравнению с одномерными структурами, использование двумерных структур возможно в меньшем числе случаев, но может иметь преимущества при реализации алгоритмов на параллельных компьютерах с распределенной памятью. Представленные исследования обобщают на случай двумерной структуры некоторые аспекты метода построения параллельных вычислительных процессов, организованных в одномерную структуру. В более ранних теоретических исследованиях параллельность зернистых вычислений гарантировалась при наличии информационных разрезов по всем координатам итерационного пространства, а для более простого случая одномерной структуры – по двум координатам. Ключевые слова: параллельные вычисления, распараллеливание алгоритмов, параллельный компьютер с распределенной памятью, уменьшение числа обменов данными.

Набор операторов enddo enddo enddo enddo

Pξ d gl τ

Qξ Pξ

2TPηPξ Nz

Список использованных источников

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parameterized tiled loops for free

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jun 10, 2007
Citations: 99	License type: cc-by

Similar Papers

Parameterized tiled loops for free
Lakshminarayanan Renganarayanan ... Daegon Kim
ACM SIGPLAN Notices | VOL. 42
Lakshminarayanan Renganarayanan, et. al.Lakshminarayanan Renganarayanan ... Daegon Kim
10 Jun 2007
ACM SIGPLAN Notices | VOL. 42

Parameterized loop tiling
Lakshminarayanan Renganarayanan ... Michelle Mills Strout
ACM Transactions on Programming Languages and Systems | VOL. 34
Lakshminarayanan Renganarayanan, et. al.Lakshminarayanan Renganarayanan ... Michelle Mills Strout
01 Apr 2012
ACM Transactions on Programming Languages and Systems | VOL. 34

Tuning iteration space slicing based tiled multi-core code implementing Nussinov\u2019s RNA folding
Marek Palkowski ... Wlodzimierz Bielecki
BMC Bioinformatics | VOL. 19
Marek Palkowski, et. al.Marek Palkowski ... Wlodzimierz Bielecki
15 Jan 2018
Tuning iteration space slicing based tiled multi-core code implementing Nussinov\u2019s RNA folding
Marek Palkowski ... Wlodzimierz Bielecki

Multi-level tiling
Daegon Kim ... Dave Rostron
-
Daegon Kim, et. al.Daegon Kim ... Dave Rostron
10 Nov 2007
10 Nov 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parameterized tiled loops for free

Abstract

Highlights

Summary

Talk to us

Similar Papers