SASA: A Scalable and Automatic Stencil Acceleration Framework for Optimized Hybrid Spatial and Temporal Parallelism on HBM-based FPGAs

Xingyu Tian,Zhifan Ye,Zhenman Fang,Licheng Guo,Alec Lu,Yuze Chi

doi:10.1145/3572547

Abstract

Stencil computation is one of the fundamental computing patterns in many application domains such as scientific computing and image processing. While there are promising studies that accelerate stencils on FPGAs, there lacks an automated acceleration framework to systematically explore both spatial and temporal parallelisms for iterative stencils that could be either computation-bound or memory-bound. In this article, we present SASA, a scalable and automatic stencil acceleration framework on modern HBM-based FPGAs. SASA takes the high-level stencil DSL and FPGA platform as inputs, automatically exploits the best spatial and temporal parallelism configuration based on our accurate analytical model, and generates the optimized FPGA design with the best parallelism configuration in TAPA high-level synthesis C++ as well as its corresponding host code. Compared to state-of-the-art automatic stencil acceleration framework SODA that only exploits temporal parallelism, SASA achieves an average speedup of 3.41× and up to 15.73× speedup on the HBM-based Xilinx Alveo U280 FPGA board for a wide range of stencil kernels.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SASA: A Scalable and Automatic Stencil Acceleration Framework for Optimized Hybrid Spatial and Temporal Parallelism on HBM-based FPGAs

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Reconfigurable Technology and Systems

Lead the way for us

Journal: ACM Transactions on Reconfigurable Technology and Systems	Publication Date: Apr 17, 2023
Citations: 3

Similar Papers

Exploiting Spatial and Temporal Parallelism in the Multithreaded Node Architecture Implemented on Superscalar RISC Processors
D J Hwang ... S H Cho
-
D J Hwang, et. al.D J Hwang ... S H Cho
01 Aug 1993
01 Aug 1993

Efficient and Correct Stencil Computation via Pattern Matching and Static Typing
Dominic Orchard ... Alan Mycroft
Electronic Proceedings in Theoretical Computer Science | VOL. 66
Dominic Orchard, et. al.Dominic Orchard ... Alan Mycroft
01 Sep 2011
Electronic Proceedings in Theoretical Computer Science | VOL. 66

Word2Vec FPGA Accelerator Based on Spatial and Temporal Parallelism
Hasitha Muthumala Waidyasooriya ... Masanori Hariyama
-
Hasitha Muthumala Waidyasooriya, et. al.Hasitha Muthumala Waidyasooriya ... Masanori Hariyama
01 Jan 2023
01 Jan 2023

MBSEM image acquisition and image processing in LabView FPGA
Shammi Rahangdale ... P Kruit
-
Shammi Rahangdale, et. al.Shammi Rahangdale ... P Kruit
01 May 2016
01 May 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SASA: A Scalable and Automatic Stencil Acceleration Framework for Optimized Hybrid Spatial and Temporal Parallelism on HBM-based FPGAs

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Reconfigurable Technology and Systems