Exploring Graphics Processing Unit (GPU) Resource Sharing Efficiency for High Performance Computing

Teng Li,Tarek El-Ghazawi,Vikram Narayana

doi:10.3390/computers2040176

Teng Li, Tarek El-Ghazawi + Show 1 more

Open Access

https://doi.org/10.3390/computers2040176

Copy DOI

Journal: Computers	Publication Date: Nov 19, 2013
Citations: 27	License type: CC BY 4.0

Affiliation: George Washington University

Abstract

The increasing incorporation of Graphics Processing Units (GPUs) as accelerators has been one of the forefront High Performance Computing (HPC) trends and provides unprecedented performance; however, the prevalent adoption of the Single-Program Multiple-Data (SPMD) programming model brings with it challenges of resource underutilization. In other words, under SPMD, every CPU needs GPU capability available to it. However, since CPUs generally outnumber GPUs, the asymmetric resource distribution gives rise to overall computing resource underutilization. In this paper, we propose to efficiently share the GPU under SPMD and formally define a series of GPU sharing scenarios. We provide performance-modeling analysis for each sharing scenario with accurate experimentation validation. With the modeling basis, we further conduct experimental studies to explore potential GPU sharing efficiency improvements from multiple perspectives. Both further theoretical and experimental GPU sharing performance analysis and results are presented. Our results not only demonstrate the significant performance gain for SPMD programs with the proposed efficient GPU sharing, but also the further improved sharing efficiency with the optimization techniques based on our accurate modeling.

Highlights

Recent years have seen the proliferation of Graphics Processing Units (GPUs) as application accelerators in High Performance Computing (HPC) Systems, due to the rapid advancements in graphic processing technology over the past few years and the introduction of programmable processors in GPUs, which is known as GPGPU or General-Purpose Computation on Graphic Processing Units [1]
A series of GPU sharing execution models have been introduced for each of the sharing scenarios, and we provide a theoretical prediction of the attainable performance gain over the non-sharing scenario
Initial performance benchmarking was conducted to validate the accuracy of the proposed sharing scenario modeling, followed by the detailed performance analysis for each of the sharing scenarios using varied benchmark profiles

Summary

Introduction

Recent years have seen the proliferation of Graphics Processing Units (GPUs) as application accelerators in High Performance Computing (HPC) Systems, due to the rapid advancements in graphic processing technology over the past few years and the introduction of programmable processors in GPUs, which is known as GPGPU or General-Purpose Computation on Graphic Processing Units [1]. A wide range of HPC systems have incorporated GPUs to accelerate applications by utilizing the unprecedented floating point performance and massively parallel processor architectures of modern. From an overall perspective, to achieve GPU sharing, our proposed sharing approach is to launch multiple GPU kernels from multi-processes/threads using CUDA streaming execution within a single GPU context, while the single context requirement is met by launching kernels from a single process, such as our virtualization implementation. Multiple perspectives of optimization are being considered for different sharing scenarios, ranging from the problem/kernel size and parallelisms of the SPMD program to optimizable sharing scenarios Based on these factors, we provide experimental optimization analysis and achieve an optimized I/O concurrency for kernels under Time Sharing, a better Streaming Multiprocessor (SM).

Related Work

Background of GPU Computing

Programming Models

An Architectural Model

GPU Device-Level Execution Flow

GPU Sharing Approach with Streams for SPMD Programs

GPU Sharing Scenarios

Exclusive Space Sharing

Non-Exclusive Space Sharing

Time Sharing

GPU Sharing and Execution Model

Execution Model for Compute-Intensive Applications

Theoretical Performance Gains

Experimental Analysis and Performance Results

Experimental Validation of the Sharing Model

Performance Prediction from the Model

Sharing Efficiency Exploration and Improvement Potential Analysis

Sharing Scenario Casting

Performance Gains with GPU Sharing for SPMD Programs

Findings

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring Graphics Processing Unit (GPU) Resource Sharing Efficiency for High Performance Computing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computers

Lead the way for us

Similar Papers

GPU Resource Sharing and Virtualization on High Performance Computing Systems
Teng Li ... Tarek El-Ghazawi
-
Teng Li, et. al.Teng Li ... Tarek El-Ghazawi
01 Sep 2011
01 Sep 2011

Compass SPMD: a SPMD vectorized tracking algorithm
Placido Fernandez Declara ... L Silvestris
EPJ Web of Conferences | VOL. 245
Placido Fernandez Declara, et. al.Placido Fernandez Declara ... L Silvestris
01 Jan 2020
EPJ Web of Conferences | VOL. 245

Parallel Document Inversion using GPU
Sungbo Jung ... Dar-Jen Chang
-
Sungbo Jung, et. al.Sungbo Jung ... Dar-Jen Chang
11 Oct 2016
11 Oct 2016

Large Scale Document Inversion using a Multi-threaded Computing System.
Sungbo Jung ... Dar-Jen Chang
ACM SIGAPP applied computing review : a publication of the Special Interest Group on Applied Computing | VOL. 17
Sungbo Jung, et. al.Sungbo Jung ... Dar-Jen Chang
03 Aug 2017
ACM SIGAPP applied computing review : a publication of the Special Interest Group on Applied Computing | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring Graphics Processing Unit (GPU) Resource Sharing Efficiency for High Performance Computing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computers