Decoupling the programming model from resource management in throughput processors

Nandita Vijaykumar ,Gennady Pekhimenko ,Samira Khan ,Ashish Shrestha ,Saugata Ghose ,Adwait Jog ,Kevin Hsieh ,Phillip B Gibbons ,Onur Mutlu

doi:10.1049/pbpc022e_ch4

Abstract

This chapter introduces a new resource virtualization framework, Zorua, that decouples the graphics processing unit (GPU) programming model from the management of key on-chip resources in hardware to enhance programming ease, portability, and performance. The application resource specification-a static specification of several parameters such as the number of threads and the scratchpad memory usage per thread block-forms a critical component of the existing GPU programming models. This specification determines the parallelism, and, hence, performance of the application during execution because the corresponding on-chip hardware resources are allocated and managed purely based on this specification. This tight coupling between the software-provided resource specification and resource management in hardware leads to significant challenges in programming ease, portability, and performance, as we demonstrate in this chapter using real data obtained on state-of-the-art GPU systems. Our goal in this work is to reduce the dependence of performance on the software-provided static resource specification to simultaneously alleviate the above challenges. To this end, we introduce Zorua, a new resource virtualization framework, that decouples the programmer-specified resource usage of a GPU application from the actual allocation in the on-chip hardware resources. Zorua enables this decoupling by virtualizing each resource transparently to the programmer. The virtualization provided by Zorua builds on two key concepts-dynamic allocation of the on-chip resources and their oversubscription using a swap space in memory. Zorua provides a holistic GPU resource virtualization strategy designed to (i) adaptively control the extent of oversubscription and (ii) coordinate the dynamic management of multiple on-chip resources to maximize the effectiveness of virtualization.We demonstrate that by providing the illusion of more resources than physically available via controlled and coordinated virtualization, Zorua offers several important benefits: (i) Programming ease. It eases the burden on the programmer to provide code that is tuned to efficiently utilize the physically available on-chip resources. (ii) Portability. It alleviates the necessity of retuning an application's resource usage when porting the application across GPU generations. (iii) Performance. By dynamically allocating resources and carefully oversubscribing them when necessary, Zorua improves or retains the performance of applications that are already highly tuned to best utilize the resources. The holistic virtualization provided by Zorua has many other potential uses, e.g., fine-grained resource sharing among multiple kernels, low latency preemption of GPU programs, and support for dynamic parallelism, which we describe in this chapter.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Decoupling the programming model from resource management in throughput processors

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Zorua: a holistic approach to resource virtualization in GPUs
...
-
, et. al. ...
15 Oct 2016
15 Oct 2016

Zorua: A holistic approach to resource virtualization in GPUs
Nandita Vijaykumar ... Kevin Hsieh
-
Nandita Vijaykumar, et. al.Nandita Vijaykumar ... Kevin Hsieh
01 Oct 2016
01 Oct 2016

Methodology to Increase the Computational Speed to Obtain the Fractal Dimension Using GPU Programming
Juan Ruiz De Miras ... Jesús Jiménez Ibáñez
-
Juan Ruiz De Miras, et. al.Juan Ruiz De Miras ... Jesús Jiménez Ibáñez
01 Jan 2015
01 Jan 2015

Cache Miss Analysis for GPU Programs Based on Stack Distance Profile
Tao Tang ... Yisong Lin
-
Tao Tang, et. al.Tao Tang ... Yisong Lin
01 Jun 2011
01 Jun 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Decoupling the programming model from resource management in throughput processors

Abstract

Talk to us

Similar Papers