HePREM: Enabling predictable GPU execution on heterogeneous SoC

Bjorn Forsberg,Luca Benini,Andrea Marongiu

doi:10.23919/date.2018.8342066

Abstract

Heterogeneous systems-on-a-chip are increasingly embracing shared memory designs, in which a single DRAM is used for both the main CPU and an integrated GPU. This architectural paradigm reduces the overheads associated with data movements and simplifies programmability. However, the deployment of real-time workloads on such architectures is troublesome, as memory contention significantly increases execution time of tasks and the pessimism in worst-case execution time (WCET) estimates. The Predictable Execution Model (PREM) separates memory and computation phases in real-time codes, then arbitrates memory phases from different tasks such that only one core at a time can access the DRAM. This paper revisits the original PREM proposal in the context of heterogeneous SoCs, proposing a compiler-based approach to make GPU codes PREM-compliant. Starting from high-level specifications of computation offloading, suitable program regions are selected and separated into memory and compute phases. Our experimental results show that the proposed technique is able to reduce the sensitivity of GPU kernels to memory interference to near zero, and achieves up to a 20 χ reduction in the measured WCET.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

HePREM: Enabling predictable GPU execution on heterogeneous SoC

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Mar 1, 2018
Citations: 31	License type: other-oa

Similar Papers

Statistical-Based WCET Estimation and Validation
...
-
, et. al. ...
01 Jan 2009
01 Jan 2009

Static probabilistic worst case execution time estimation for architectures with faulty instruction caches
Damien Hardy ... Isabelle Puaut
-
Damien Hardy, et. al.Damien Hardy ... Isabelle Puaut
16 Oct 2013
16 Oct 2013

Static probabilistic worst case execution time estimation for architectures with faulty instruction caches
Damien Hardy ... Isabelle Puaut
Real-Time Systems | VOL. 51
Damien Hardy, et. al.Damien Hardy ... Isabelle Puaut
05 Nov 2014
Real-Time Systems | VOL. 51

DTM: Degraded Test Mode for Fault-Aware Probabilistic Timing Analysis
Mladen Slijepcevic ... Eduardo Quinones
-
Mladen Slijepcevic, et. al.Mladen Slijepcevic ... Eduardo Quinones
01 Jul 2013
01 Jul 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HePREM: Enabling predictable GPU execution on heterogeneous SoC

Abstract

Talk to us

Similar Papers