Abstract

Coarse-Grained Reconfigurable Architectures (CGRA) promise both low power and high performance coupled with flexibility, however automatic mapping of applications to such platforms remains a great research challenge. Efficient manual mapping of the data-centric kernels of applications yields great results, however these contain internally control-flow specific tasks, which introduce mapping irregularities and execution inefficiencies on CGRAs. In this paper, we explore analysis, design and synthesis of reconfigurable structures for efficient application-specific control-flow processing, aiming to develop a methodology to design reconfigurable control-flow acceleration modules. Such modules can be coupled with generic CGRAs, off-loading execution of irregular and ill-suited sequential control-flow subroutines, enabling the CGRA to exploit a clean, regular data-flow centric mapping. Considering different architectural paradigms, we design and compare a functional array-based design, a VLIW-style design and an automatically generated design based on graph theoretic concepts against the ASIC implementation of the control flow operations for several kernels of the linear algebra domain. Such reconfigurable control-flow specific accelerators are a first step towards automating CGRA-based accelerator design and application mapping from high-level descriptions.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.