FPGA‐based HPC accelerators: An evaluation on performance and energy efficiency

Tan Nguyen,Douglas Doerfler,Colin Maclean,Marco Siracusa,Nicholas J Wright,Samuel Williams

doi:10.1002/cpe.6570

Abstract

AbstractHardware specialization is a promising direction for the future of digital computing. Reconfigurable technologies enable hardware specialization with modest non‐recurring engineering cost, but their performance and energy efficiency compared to state‐of‐the‐art processor architectures remain an open question. In this article, we use FPGAs to evaluate the benefits of building specialized hardware for numerical kernels found in scientific applications. In order to properly evaluate performance, we not only compare Intel Arria 10 and Xilinx U280 performance against Intel Xeon, Intel Xeon Phi, and NVIDIA V100 GPUs, but we also extend the Empirical Roofline Toolkit (ERT) to FPGAs in order to assess our results in terms of the Roofline model. We show design optimization and tuning techniques for peak FPGA performance at reasonable hardware usage and power consumption. As FPGA peak performance is known to be far less than that of a GPU, we also benchmark the energy efficiency of each platform for the scientific kernels comparing against microbenchmark and technological limits. Results show that while FPGAs struggle to compete in absolute terms with GPUs on memory‐ and compute‐intensive kernels, they require far less power and can deliver nearly the same energy efficiency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Concurrency and Computation: Practice and Experience	Publication Date: Aug 22, 2021
Citations: 15	License type: cc-by-nc

R Discovery Prime

R Discovery Prime

FPGA‐based HPC accelerators: An evaluation on performance and energy efficiency

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience

Lead the way for us

Similar Papers

The Performance and Energy Efficiency Potential of FPGAs in Scientific Computing
Tan Nguyen ... Nicholas J Wright
-
Tan Nguyen, et. al.Tan Nguyen ... Nicholas J Wright
01 Nov 2020
01 Nov 2020

On the Mitigation of Cache Hostile Memory Access Patterns on Many-Core CPU Architectures
Tom Deakin ... Simon Mcintosh-Smith
-
Tom Deakin, et. al.Tom Deakin ... Simon Mcintosh-Smith
01 Jan 2017
01 Jan 2017

Efficient Strategies of Compressing Three-Dimensional Sparse Arrays Based on Intel XEON and Intel XEON Phi Environments
Chun-Yuan Lin ... Che-Lun Hung
-
Chun-Yuan Lin, et. al.Chun-Yuan Lin ... Che-Lun Hung
01 Oct 2015
01 Oct 2015

Energy characterization and instruction-level energy model of Intel's Xeon Phi processor
Yakun Sophia Shao ... David Brooks
-
Yakun Sophia Shao, et. al.Yakun Sophia Shao ... David Brooks
01 Sep 2013
01 Sep 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FPGA‐based HPC accelerators: An evaluation on performance and energy efficiency

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience