Kernel-Based Resource Allocation for Improving GPU Throughput While Minimizing the Activity Divergence of SMs

Zois-Gerasimos Tasoulas,Iraklis Anagnostopoulos

doi:10.1109/tcsi.2019.2933245

Abstract

Graphics Processing Units (GPUs) have been established as a major part of modern computing systems. As technology scales down, GPUs integrate more computing elements that accelerate massively parallel applications. Due to the increase of GPU cores, sophisticated resource allocation techniques are required in order to take advantage of the underlying architecture. At the same time, circuit aging rises as a challenging problem due to the reduction of chip dimensions, temperature, and utilization of GPU resources. Aging increases the switching delay of the transistors resulting in performance degradation, synchronization and lifetime problems. This becomes more prominent in GPUs due to the different behavior and characteristics of GPU applications. Applications utilize differently the computing resources and they consequently result in imbalanced aging. In this paper, we employ a kernel-based resource allocation for optimizing GPU throughput while simultaneously minimizing the activity divergence of Streaming Multiprocessors (SMs). The proposed methodology achieves improved throughput by effectively utilizing the characteristics of the application kernels offloaded on the platform, and reduced aging divergence among the SMs. Results show that our technique improves the GPU throughput by 18% and 13.8% for different GPU micro-architectures, while minimizing the aging divergence up to 89.6% comparing to other aging-aware methodologies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Kernel-Based Resource Allocation for Improving GPU Throughput While Minimizing the Activity Divergence of SMs

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems I: Regular Papers

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems I: Regular Papers	Publication Date: Aug 22, 2019
Citations: 34

Similar Papers

Accelerate GPU Concurrent Kernel Execution by Mitigating Memory Pipeline Stalls
Hongwen Dai ... Zhen Lin
-
Hongwen Dai, et. al.Hongwen Dai ... Zhen Lin
01 Feb 2018
01 Feb 2018

To GPU synchronize or not GPU synchronize?
Wu-Chun Feng ... Shucai Xiao
-
Wu-Chun Feng, et. al. Wu-Chun Feng ... Shucai Xiao
01 May 2010
01 May 2010

Automated Architecture-Aware Mapping of Streaming Applications Onto GPUs
Andrei Hagiescu ... Rick Siow Mong Goh
-
Andrei Hagiescu, et. al.Andrei Hagiescu ... Rick Siow Mong Goh
01 May 2011
01 May 2011

SmCompactor
Qichen Chen ... Yongseok Son
-
Qichen Chen, et. al.Qichen Chen ... Yongseok Son
22 Mar 2021
22 Mar 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Kernel-Based Resource Allocation for Improving GPU Throughput While Minimizing the Activity Divergence of SMs

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems I: Regular Papers