Accelerating Space Radiative Transfer on FPGA using OpenCL

Norihisa Fujita,Makito Abe,Ryohei Kobayashi,Masayuki Umemura,Yuma Oobata,Kohji Yoshikawa,Taisuke Boku,Yoshiki Yamaguchi

doi:10.1145/3241793.3241799

Abstract

One of the recent challenges faced by High-Performance Computing (HPC) is how to apply Field-Programmable Gate Array (FPGA) technology to accelerate a next-generation supercomputer as an efficient method of achieving high performance and low power consumption. Graphics Processing Unit (GPU) is the most commonly used accelerator for HPC supported by regularly executed high degree of parallel operations which causes performance bottleneck in some cases. On the other hand, there are great opportunities to flexibly and efficiently utilize FPGAs in logic circuits to fit various computing situations. However, it is not easy for application developers to implement FPGA logic circuits for their applications and algorithms, which generally require complicated hardware logic descriptions. Because of the progress made in the FPGA development environment in recent years, the High-Level Synthesis (HLS) development environment using the OpenCL language has become popular. Based on our experience describing kernels using OpenCL, we found that a more aggressive programming strategy is necessary to realize true high performance based on a codesign concept to implement the necessary features and operations to fit the target application in an FPGA design. In this paper, we optimize the Authentic Radiation Transfer (ART) method on an FPGA using OpenCL. We also discuss a method to parallelize its computation in an FPGA and a method to optimize the OpenCL code on FPGAs. Using a codesigned method for the optimized programming of a specific application with OpenCL for an FPGA, we achieved a performance that is 6.9 times faster than that of a CPU implementation using OpenMP, and almost the same performance as a GPU implementation using CUDA. The ART code should work on a larger configuration with multiple FPGAs requiring interconnections between them. Considering the current advanced FPGAs with interconnection features, we believe that their parallelized implementation with multiple FPGAs will achieve a higher performance than GPU.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Accelerating Space Radiative Transfer on FPGA using OpenCL

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

FPGAs for Software Programmers
-
-
--
01 Jan 2015
01 Jan 2015

MITRACA: A Next-Gen Heterogeneous Architecture
Riadh Ben Abdelhamid ... Taisuke Boku
-
Riadh Ben Abdelhamid, et. al.Riadh Ben Abdelhamid ... Taisuke Boku
01 Oct 2019
01 Oct 2019

Baldur: A Hybrid Blockchain Database with FPGA or GPU Acceleration
Rares Ifrim ... Decebal Popescu
-
Rares Ifrim, et. al.Rares Ifrim ... Decebal Popescu
23 Jun 2023
23 Jun 2023

FPGA Acceleration of 3GPP Channel Model Emulator for 5G New Radio
Nasir Ali Shah ... Luciano Lavagno
IEEE Access | VOL. 10
Nasir Ali Shah, et. al.Nasir Ali Shah ... Luciano Lavagno
01 Jan 2021
IEEE Access | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accelerating Space Radiative Transfer on FPGA using OpenCL

Abstract

Talk to us

Similar Papers