Exploring memory synchronization and performance considerations for FPGA platform using the high-abstracted OpenCL framework: Benchmarks development and analysis.

Abedalmuhdi Almomany,Amin Jarrah,Muhammed Sutcu

doi:10.1371/journal.pone.0301720

Abstract

A key benefit of the Open Computing Language (OpenCL) software framework is its capability to operate across diverse architectures. Field programmable gate arrays (FPGAs) are a high-speed computing architecture used for computation acceleration. This study investigates the impact of memory access time on overall performance in general FPGA computing environments through the creation of eight benchmarks within the OpenCL framework. The developed benchmarks capture a range of memory access behaviors, and they play a crucial role in assessing the performance of spinning and sleeping on FPGA-based architectures. The results obtained guide the formulation of new implementations and contribute to defining an abstraction of FPGAs. This abstraction is then utilized to create tailored implementations of primitives that are well-suited for this platform. While other research endeavors concentrate on creating benchmarks with the Compute Unified Device Architecture (CUDA) to scrutinize the memory systems across diverse GPU architectures and propose recommendations for future generations of GPU computation platforms, this study delves into the memory system analysis for the broader FPGA computing platform. It achieves this by employing the highly abstracted OpenCL framework, exploring various data workload characteristics, and experimentally delineating the appropriate implementation of primitives that can seamlessly integrate into a design tailored for the FPGA computing platform. Additionally, the results underscore the efficacy of employing a task-parallel model to mitigate the need for high-cost synchronization mechanisms in designs constructed on general FPGA computing platforms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring memory synchronization and performance considerations for FPGA platform using the high-abstracted OpenCL framework: Benchmarks development and analysis.

Abstract

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Journal: PLOS ONE	Publication Date: May 13, 2024
License type: CC BY 4.0

Similar Papers

Algorithms for efficient runtime fault recovery on diverse FPGA architectures
J Lach ... M Potkonjak
-
J Lach, et. al.J Lach ... M Potkonjak
01 Nov 1999
01 Nov 1999

Using FPGAs to implement a reconfigurable highly parallel computer
Arne Linde ... Tomas Nordström
-
Arne Linde, et. al.Arne Linde ... Tomas Nordström
01 Jan 1992
01 Jan 1992

Wearable FPGA Platform for Accelerated DSP and AI Applications
Daniel Roggen ... Robert Cobden
-
Daniel Roggen, et. al.Daniel Roggen ... Robert Cobden
21 Mar 2022
21 Mar 2022

HPC Workflow on Diverse XPU Architectures with oneAPI
Mandeep Kumar ... Gagandeep Kaur
-
Mandeep Kumar, et. al.Mandeep Kumar ... Gagandeep Kaur
24 Jun 2022
24 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring memory synchronization and performance considerations for FPGA platform using the high-abstracted OpenCL framework: Benchmarks development and analysis.

Abstract

Talk to us

Similar Papers

More From: PLOS ONE