A Dynamic and Proactive GPU Preemption Mechanism Using Checkpointing

Chen Li,Youtao Zhang,Andrew Zigerelli,Yang Guo,Jun Yang,Sheng Ma

doi:10.1109/tcad.2018.2883906

Abstract

The demand for multitasking GPUs increases whenever the GPU may be shared by multiple applications, either spatially or temporally. This requires that GPUs can be preempted and switch context to a new application while already executing one. Unlike CPUs, context switching in GPUs is prohibitively expensive due to the large context states to swap out. There have been a number of efforts on reducing the overhead of preemption, through reducing the context sizes or overlapping context switching with execution. All those techniques are reactive approaches, meaning that context switching occurs when the preemption request arrives. In this paper, we propose a dynamic and proactive mechanism to reduce the latency of preemption. We observe that kernel execution is almost always preceded by known commands in both CUDA and OpenCL implementations. Hence, a preemption can be anticipated before the actual request arrives. We study such lead time and develop a prediction scheme to perform an early state saving. When the actual preemption is invoked, an incremental update relative to the previous saved state is performed, much like the conventional checkpointing mechanism. Our design can also choose to drain or checkpointing dynamically and accurately according to the feature of kernels in the runtime. This design effectively reduces the stall time of the preempting kernel due to context switching by 58.6%. Moreover, through careful handling of the saved state, we can also reduce the overall size of saved state by an average of 23.3%, compared with a full context switching.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems	Publication Date: Jan 1, 2020
Citations: 29	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

A Dynamic and Proactive GPU Preemption Mechanism Using Checkpointing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Lead the way for us

Similar Papers

PEP
Chen Li ... Jun Yang
-
Chen Li, et. al.Chen Li ... Jun Yang
24 Jun 2018
24 Jun 2018

PEP: Proactive Checkpointing for Efficient Preemption on GPUs
Chen Li ... Yang Guo
-
Chen Li, et. al.Chen Li ... Yang Guo
01 Jun 2018
01 Jun 2018

Code complexity versus performance for GPU-accelerated scientific applications
...
-
, et. al. ...
13 Nov 2016
13 Nov 2016

Comparison of GPU- and CPU-implementations of mean-firing rate neural networks on parallel hardware
Helge Ülo Dinkelbach ... Julien Vitay
Network: Computation in Neural Systems | VOL. 23
Helge Ülo Dinkelbach, et. al.Helge Ülo Dinkelbach ... Julien Vitay
09 Nov 2012
Network: Computation in Neural Systems | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Dynamic and Proactive GPU Preemption Mechanism Using Checkpointing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems