Abstract

The productivity achieved when developing applications on high-performance reconfigurable heterogeneous computing (HPRHC) systems is increased by using the Open Computing Language (OpenCL). However, the hardware produced by OpenCL compilers in field-programmable gate arrays (FPGAs) can result in severe performance bottlenecks that are challenging to solve. The problem is compounded by the fact that the generated netlist details are disorganized, making them mostly unreadable and only partially visible to designers. This paper proposes an in-FPGA instrumentation method and a new framework for extracting the FPGA-cycle-accurate timing performances of OpenCL-based designs. The results clearly show that the chosen execution model for OpenCL-based designs strongly affects the timing performance when it is not properly implemented. Our framework is implemented on an HPRHC platform that contains a CPU and two Arria10 FPGAs, and it is evaluated with a wide variety of benchmarks with different complexities. After testing on the reported benchmarks, the average logic overhead for one inserted instrument is 0.2 % of the total amount of adaptive look-up tables (ALUTs) and 0.1 % of the total registers in an FPGA. This resource utilization is between 1.5 and six times lower than those reported in the best previously published works. The scalability of the framework is also evaluated by inserting up to 50 instruments. The experimental results show that the average logic utilization per instrument is 0.19 % of the ALUTs and 0.17 % of the registers in the FPGA when 50 instruments are inserted.

Highlights

  • Field-programmable gate arrays (FPGAs) have progressively evolved as powerful accelerators for high-performance reconfigurable heterogeneous computing (HPRHC) systems

  • It does so by adding embedded instruments to Open Computing Language (OpenCL) kernels. These instruments enable the timing performance analyses of OpenCL designs compiled to FPGA targets

  • This makes our work the first to consider the insertion-based instrumentation restrictions in OpenCL designs compiled for FPGA targets

Read more

Summary

INTRODUCTION

Field-programmable gate arrays (FPGAs) have progressively evolved as powerful accelerators for high-performance reconfigurable heterogeneous computing (HPRHC) systems. To improve debugging and performance analysis capabilities, these instruments allow the observation of relevant values and accurate run-time analyses of any data in any section of the generated FPGA circuit This instrumentation challenge is a known concern with hardware produced by HLS tools [15]. This paper addresses the problem of instrumenting OpenCL designs targeting FPGA-based computing systems and the challenge of portability and scalability with regard to such an instrumentation method It notably presents an instrumentation method and framework that can be used by a software developer and applied to any HPRHC platform using OpenCL.

AND RELATED WORK
OPENCL PLATFORM DESCRIPTION
LIMITATIONS OF EMBEDDED LOGIC ANALYZERS FOR FPGAS
RELATED WORK ON IN-SYSTEM FPGA INSTRUMENTATION
IN-SYSTEM INSTRUMENTATION FRAMEWORK OF OPENCL-BASED FPGA DESIGNS
PROPOSED INSTRUMENTATION METHOD
OVERALL FRAMEWORK
FRAMEWORK CAPABILITIES
PERFORMANCE MODELING AND INSTRUMENTATION
FRAMEWORK IMPLEMENTATION AND RESULTS
EXPERIMENTAL SETUP
IMPLEMENTATION OF THE INSTRUMENTATION LIBRARY
INSTRUMENTATION METHODOLOGY AND RESULTS
EVALUATION OF FRAMEWORK SCALABILITY
FRAMEWORK COMPARISON
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.