Application aware Scalable Architecture for GPGPU

Winnie Thomas,Rohin D Daruwala

doi:10.1016/j.sysarc.2018.07.003

Abstract

Modern General Purpose Graphic Processing Units (GPGPU) offer high throughput for parallel applications with their hundreds of integrated cores. However, there are applications that experience performance saturation and even degradation with increasing number of cores. At present the scheduler in the GPU hardware allocates all the available resources to maximize their utilization. We observed that applications have preference towards specific set of resources. The utilization of other redundant resources can reduce the throughput of the applications. To overcome this problem, in this paper we first classify the applications into two types; type-I that dominantly require processing cores and type-II that rely on the performance of the memory-system. We propose an Application aware Scalable Architecture (ApSA) for GPGPU based on classified applications which performs run-time tailoring of the GPU resources to present an optimal set of resources to the running application. The results are analyzed and compared in terms of instructions per cycle, bandwidth utilization and branch divergence. We found that if the application is identified to be of type-I with the proposed technique the average profiling overhead is 1.6%. Type-II applications experience average profiling overhead of 1.15%. The average power saved by clock-gating redundant resources in the case of type-II applications is 20.08%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Application aware Scalable Architecture for GPGPU

Abstract

Talk to us

Similar Papers

More From: Journal of Systems Architecture

Lead the way for us

Journal: Journal of Systems Architecture	Publication Date: Jul 26, 2018
Citations: 3

Similar Papers

Modeling and characterizing GPGPU reliability in the presence of soft errors
Jingweijia Tan ... Yang Yi
Parallel Computing | VOL. 39
Jingweijia Tan, et. al.Jingweijia Tan ... Yang Yi
26 Jan 2013
Parallel Computing | VOL. 39

Improving branch divergence performance on GPGPU with a new PDOM stack and multi-level warp scheduling
Licheng Yu ... Xingsheng Tang
Journal of Systems Architecture | VOL. 60
Licheng Yu, et. al.Licheng Yu ... Xingsheng Tang
27 Nov 2013
Journal of Systems Architecture | VOL. 60

RISE
Jingweijia Tan ... Xin Fu
-
Jingweijia Tan, et. al.Jingweijia Tan ... Xin Fu
19 Sep 2012
19 Sep 2012

A Survey of GPGPU Parallel Processing Architecture Performance Optimization
Shiwei Jia ... Ze Tian
-
Shiwei Jia, et. al.Shiwei Jia ... Ze Tian
13 Oct 2021
13 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Application aware Scalable Architecture for GPGPU

Abstract

Talk to us

Similar Papers

More From: Journal of Systems Architecture