Optimizing Spatial Mapping of Nested Loop for Coarse-Grained Reconfigurable Architectures

Dajiang Liu,Shaojun Wei,Leibo Liu,Shouyi Yin,Yu Peng

doi:10.1109/tvlsi.2014.2371854

Abstract

Coarse-grained reconfigurable architectures (CGRAs) have drawn increasing attention due to their flexibility and efficiency. Loops in applications are often mapped onto CGRAs for acceleration, and the mapping of loops onto CGRA is quite a challenging work due to the parallel execution paradigm and constrained hardware resource. To map loops onto CGRAs efficiently, it is important to transform loops into pieces that obey hardware resource constraints with less overhead (e.g., communication and configuration overhead). In this paper, we tackle this problem by establishing a performance optimization problem, including loop transformation and back- end placing and routing. A novel searching strategy is also designed to find the optimal result efficiently. Finally, we built a complete flow of mapping loop nests onto CGRA. Experiment results on most kernels of the Polybench show that our proposed approach can improve the performance of the kernels by 42% on average, as compared with the state-of-the-art methods. The runtime complexity of our approach is also acceptable.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimizing Spatial Mapping of Nested Loop for Coarse-Grained Reconfigurable Architectures

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Lead the way for us

Journal: IEEE Transactions on Very Large Scale Integration (VLSI) Systems	Publication Date: Nov 1, 2015
Citations: 13

Similar Papers

DRMaSV: Enhanced Capability Against Hardware Trojans in Coarse Grained Reconfigurable Architectures
Leibo Liu ... Shaojun Wei
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 37
Leibo Liu, et. al.Leibo Liu ... Shaojun Wei
01 Apr 2018
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 37

Time sharing of Runtime Coarse-Grain Reconfigurable Architectures processing elements in multi-process systems
Benjamin Carrion Schafer
-
Benjamin Carrion SchaferBenjamin Carrion Schafer
01 Dec 2014
01 Dec 2014

Towards Higher Performance and Robust Compilation for CGRA Modulo Scheduling
Zhongyuan Zhao ... Wenzhi Yin
IEEE Transactions on Parallel and Distributed Systems | VOL. 31
Zhongyuan Zhao, et. al.Zhongyuan Zhao ... Wenzhi Yin
01 Sep 2020
IEEE Transactions on Parallel and Distributed Systems | VOL. 31

DyMeP: An Infrastructure to Support Dynamic Memory Binding for Runtime Mapping in CGRAs
Muhammad Adeel Tajammul ... S.M.A Jafri
-
Muhammad Adeel Tajammul, et. al.Muhammad Adeel Tajammul ... S.M.A Jafri
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimizing Spatial Mapping of Nested Loop for Coarse-Grained Reconfigurable Architectures

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems