Hybrid CPU\u2013GPU execution support in the skeleton programming framework SkePU

Tomas Öhberg,Christoph Kessler,August Ernstsson

doi:10.1007/s11227-019-02824-7

Tomas Öhberg, Christoph Kessler + Show 1 more

Open Access

https://doi.org/10.1007/s11227-019-02824-7

Copy DOI

Journal: The Journal of Supercomputing	Publication Date: Mar 25, 2019
Citations: 9	License type: open-access

Affiliation: Linköping University

Abstract

In this paper, we present a hybrid execution backend for the skeleton programming framework SkePU. The backend is capable of automatically dividing the workload and simultaneously executing the computation on a multi-core CPU and any number of accelerators, such as GPUs. We show how to efficiently partition the workload of skeletons such as Map, MapReduce, and Scan to allow hybrid execution on heterogeneous computer systems. We also show a unified way of predicting how the workload should be partitioned based on performance modeling. With experiments on typical skeleton instances, we show the speedup for all skeletons when using the new hybrid backend. We also evaluate the performance on some real-world applications. Finally, we show that the new implementation gives higher and more reliable performance compared to an old hybrid execution implementation based on dynamic scheduling.

Highlights

The ever-growing demand for higher performance in computing, puts requirements on modern programming tools
To show the advantages of the static workload partitioning in the new hybrid execution backend, the experimental StarPU integration from SkePU 1 was ported to SkePU 2
We have presented a new hybrid execution backend for the skeleton programming framework SkePU, while preserving the existing API

Summary

Introduction

The ever-growing demand for higher performance in computing, puts requirements on modern programming tools. We introduce workload partitioning implementations for all data parallel skeletons in SkePU 2, capable of dividing the work between an arbitrary number of CPU cores and accelerators. The hybrid skeleton implementations use OpenMP, where the first thread will manage the accelerators and the rest of the threads will work on the CPU partition. To reduce duplication of code within SkePU, the accelerator partition is computed by the already existing CUDA or OpenCL backend implementations. Each CPU thread and the accelerator backend reduce their block of the input data to produce a temporary array of partial reductions. A more sophisticated partitioning for matrices would be hard to realize, especially for the MapOverlap skeleton, due to the many corner cases and complex data access patterns

StarPU backend implementation

Single skeleton evaluation

Generic application evaluation

Comparison to dynamic hybrid scheduling using StarPU

Related work

Findings

Conclusions and future work

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hybrid CPU\u2013GPU execution support in the skeleton programming framework SkePU

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: The Journal of Supercomputing

Lead the way for us

Similar Papers

Representing Task and Machine Heterogeneities for Heterogeneous Computing Systems
...
-
, et. al. ...
01 Sep 2000
01 Sep 2000

Flexible runtime support for efficient skeleton programming on hybrid systems
...
-
, et. al. ...
05 Jul 2011
05 Jul 2011

Possibilities of Optimal Execution of Parallel Programs Containing Simple and Iterated Loops on Heterogeneous Parallel Computational Systems with Distributed Memory
A I Avetisyan ... O I Samovarov
Programming and Computer Software | VOL. 28
A I Avetisyan, et. al.A I Avetisyan ... O I Samovarov
01 Jan 2002
Programming and Computer Software | VOL. 28

Analyzing the Robustness of Dynamic Loop Scheduling for Heterogeneous Computing Systems
Srishti Srivastava ... Ioana Banicescu
-
Srishti Srivastava, et. al.Srishti Srivastava ... Ioana Banicescu
01 Jun 2012
01 Jun 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hybrid CPU\u2013GPU execution support in the skeleton programming framework SkePU

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: The Journal of Supercomputing