Adaptive Cache Management for Energy-Efficient GPU Computing

Xuhao Chen,Zhiying Wang,Jie Lv,Li-Wen Chang,Christopher I Rodrigues,Wen-Mei Hwu

doi:10.1109/micro.2014.11

Abstract

With the SIMT execution model, GPUs can hidememory latency through massive multithreading for many applications that have regular memory access patterns. To support applications with irregular memory access patterns, cache hierarchies have been introduced to GPU architectures to capture temporal and spatial locality and mitigate the effect of irregular accesses. However, GPU caches exhibit poor efficiency due to the mismatch of the throughput-oriented execution model and its cache hierarchy design, which limits system performance and energy-efficiency. The massive amount of memory requests generated by GPU scause cache contention and resource congestion. Existing CPUcache management policies that are designed for multicoresystems, can be suboptimal when directly applied to GPUcaches. We propose a specialized cache management policy for GPGPUs. The cache hierarchy is protected from contention by the bypass policy based on reuse distance. Contention and resource congestion are detected at runtime. To avoid oversaturatingon-chip resources, the bypass policy is coordinated with warp throttling to dynamically control the active number of warps. We also propose a simple predictor to dynamically estimate the optimal number of active warps that can take full advantage of the cache space and on-chip resources. Experimental results show that cache efficiency is significantly improved and on-chip resources are better utilized for cache sensitive benchmarks. This results in a harmonic mean IPCimprovement of 74% and 17% (maximum 661% and 44% IPCimprovement), compared to the baseline GPU architecture and optimal static warp throttling, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adaptive Cache Management for Energy-Efficient GPU Computing

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Adaptive Cache Bypass and Insertion for Many-core Accelerators
Xuhao Chen ... Carl Pearson
-
Xuhao Chen, et. al.Xuhao Chen ... Carl Pearson
15 Jun 2014
15 Jun 2014

A Study on L1 Data Cache Bypassing Methods for High-Performance GPUs
Cong Thuan Do ... Jong Myon Kim
-
Cong Thuan Do, et. al.Cong Thuan Do ... Jong Myon Kim
01 Jan 2019
01 Jan 2019

Comparing LLC-Memory Traffic between CPU and GPU Architectures
Mohammad Alaul Haque Monil ... Seyong Lee
-
Mohammad Alaul Haque Monil, et. al.Mohammad Alaul Haque Monil ... Seyong Lee
01 Nov 2021
01 Nov 2021

An Efficient GPU Cache Architecture for Applications with Irregular Memory Access Patterns
Bingchao Li ... Jizeng Wei
ACM Transactions on Architecture and Code Optimization | VOL. 16
Bingchao Li, et. al.Bingchao Li ... Jizeng Wei
17 Jun 2019
ACM Transactions on Architecture and Code Optimization | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive Cache Management for Energy-Efficient GPU Computing

Abstract

Talk to us

Similar Papers