An adaptive offline implementation selector for heterogeneous parallel platforms

David Del Rio Astorga,Manuel F Dolz,Luis Miguel Sánchez,Javier Fernández,J Daniel García

doi:10.1177/1094342017698746

Abstract

Heterogeneous parallel platforms, comprising multiple processing units and architectures, have become a cornerstone in improving the overall performance and energy efficiency of scientific and engineering applications. Nevertheless, taking full advantage of their resources comes along with a variety of difficulties: developers require technical expertise in using different parallel programming frameworks and previous knowledge about the algorithms used underneath by the application. To alleviate this burden, we present an adaptive offline implementation selector that allows users to better exploit resources provided by heterogeneous platforms. Specifically, this framework selects, at compile time, the tuple device-implementation that delivers the best performance on a given platform. The user interface of the framework leverages two C++ language features: attributes and concepts. To evaluate the benefits of this framework, we analyse the global performance and convergence of the selector using two different use cases. The experimental results demonstrate that the proposed framework allows users enhancing performance while minimizing efforts to tune applications targeted to heterogeneous platforms. Furthermore, we also demonstrate that our framework delivers comparable performance figures with respect to other approaches.

Highlights

In recent years, heterogeneous parallel architectures have provided a way to improve performance and energy efficiency better than other alternatives
This section gives a brief overview about the two C++ language features used for developing the implementation selector interface: C++ attributes and concepts
We evaluate the adaptability of the selector to make appropriate decisions when a new device is attached to the heterogeneous platform each 100 training iterations

Summary

Introduction

Heterogeneous parallel architectures have provided a way to improve performance and energy efficiency better than other alternatives. Platforms comprising diverse devices (such as multi-cores, GPUs, DSPs and FPGAs) are notoriously more difficult to program effectively, since they demand for distinct frameworks and application programming interfaces [5] This fact, has led to multiple implementations of the same algorithm but targeted to different devices. In order to improve performance, developers need to analyze a priori the target platform and the application, along with its implementation alternatives and available libraries. To achieve this goal, some aspects need to be considered. An alternative to the aforementioned technique is to shift the decision-making task directly at compile time Several proposals leveraging this static approach and based on analytic models, machine learning and adaptive optimization methods can be found in the literature [1].

Related Work

Background

The hardware parallel platform description language

The adaptive offline implementation selector

The attributes-based interface

The selector module

Experimentalevaluation

Evaluation of the accuracy and performance

Evaluation of the adaptability

Comparisonwithalternativeapproaches

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The International Journal of High Performance Computing Applications	Publication Date: Mar 26, 2017
Citations: 3	License type: cc-by

R Discovery Prime

R Discovery Prime

An adaptive offline implementation selector for heterogeneous parallel platforms

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: The International Journal of High Performance Computing Applications

Lead the way for us

Similar Papers

Special Issue 19th international workshop on algorithms, models and tools for parallel computing on heterogeneous platforms (HeteroPar'21)
Rosa M Badia
Concurrency and computation : practice & experience | VOL. 35
Rosa M BadiaRosa M Badia
19 Oct 2022
Concurrency and computation : practice & experience | VOL. 35

Customization of OpenCL applications for efficient task mapping under heterogeneous platform constraints
...
-
, et. al. ...
09 Mar 2015
09 Mar 2015

Customization of OpenCL Applications for Efficient Task Mapping under Heterogeneous Platform Constraints
Edoardo Paone ... Vittorio Zaccaria
-
Edoardo Paone, et. al.Edoardo Paone ... Vittorio Zaccaria
01 Jan 2015
01 Jan 2015

Implementation of a performance optimized database join operation on FPGA-GPU platforms using OpenCL
Mehdi Roozmeh ... Luciano Lavagno
-
Mehdi Roozmeh, et. al.Mehdi Roozmeh ... Luciano Lavagno
01 Oct 2017
01 Oct 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An adaptive offline implementation selector for heterogeneous parallel platforms

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: The International Journal of High Performance Computing Applications