Early DSE and Automatic Generation of Coarse-grained Merged Accelerators

Iulian Brumar,Georgios Zacharopoulos,Yuan Yao,Saketh Rama,Gu-Yeon Wei,David Brooks

doi:10.1145/3546070

Abstract

Post-Moore’s law area-constrained systems rely on accelerators to deliver performance enhancements. Coarse-grained accelerators can offer substantial domain acceleration, but manual, ad hoc identification of code to accelerate is prohibitively expensive. Because cycle-accurate simulators and high-level synthesis (HLS) flows are so time-consuming, the manual creation of high-utilization accelerators that exploit control and data flow patterns at optimal granularities is rarely successful. To address these challenges, we present AccelMerger, the first automated methodology to create coarse-grained, control- and data-flow-rich merged accelerators. AccelMerger uses sequence alignment matching to recognize similar function call-graphs and loops, and neural networks to quickly evaluate their post-HLS characteristics. It accurately identifies which functions to accelerate, and it merges accelerators to respect an area budget and to accommodate system communication characteristics like latency and bandwidth. Merging two accelerators can save as much as 99% of the area of one. The space saved is used by a globally optimal integer linear program to allocate more accelerators for increased performance. We demonstrate AccelMerger’s effectiveness using HLS flows without any manual effort to fine-tune the resulting designs. On FPGA-based systems, AccelMerger yields application performance improvements of up to 16.7× over software implementations, and 1.91× on average with respect to state-of-the-art early-stage design space exploration tools.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Early DSE and Automatic Generation of Coarse-grained Merged Accelerators

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Embedded Computing Systems

Lead the way for us

Journal: ACM Transactions on Embedded Computing Systems	Publication Date: Jan 24, 2023
Citations: 4

Similar Papers

Acceleration of Trading System Back End with FPGAs Using High-Level Synthesis Flow
Sunil Puranik ... Swapnil Rodi
Electronics | VOL. 12
Sunil Puranik, et. al.Sunil Puranik ... Swapnil Rodi
19 Jan 2023
Electronics | VOL. 12

A High Level Synthesis Flow Using Model Driven Engineering
Sebastien Le Beux ... Jean-Luc Dekeyser
-
Sebastien Le Beux, et. al.Sebastien Le Beux ... Jean-Luc Dekeyser
08 Oct 2010
08 Oct 2010

Investigation of High-Level Synthesis tools’ applicability to data acquisition systems design based on the CMS ECAL Data Concentrator Card example
Michal Husejko ... John Evans
Journal of Physics: Conference Series | VOL. 664
Michal Husejko, et. al.Michal Husejko ... John Evans
01 Dec 2015
Journal of Physics: Conference Series | VOL. 664

Key-Value Store using High Level Synthesis Flow for Securities Trading System
Sunil Puranik ... Rajendra Patrikar
-
Sunil Puranik, et. al.Sunil Puranik ... Rajendra Patrikar
17 Aug 2020
17 Aug 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Early DSE and Automatic Generation of Coarse-grained Merged Accelerators

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Embedded Computing Systems