A Study on L1 Data Cache Bypassing Methods for High-Performance GPUs

Cong Thuan Do,Cheol Hong Kim,Min Goo Moon,Jong Myon Kim

doi:10.1007/978-981-13-5907-1_22

Abstract

Graphics Processing Units (GPUs) with massive parallel architecture have been widely used to boost performance of both graphics and general-purpose programs. GPGPUs become one of the most attractive platforms in exploiting plentiful thread-level parallelism. In recent GPUs, cache hierarchies have been employed to deal with applications with irregular memory access patterns. Unfortunately, GPU caches exhibit poor efficiency due to arising many performance challenges such as cache contention and resource congestion caused by large number of active threads in GPUs. Cache bypassing can be a solution to reduce the impact of cache contention and resource congestion. In this paper, we introduce a new cache bypassing technique that is able to make effective bypassing decisions. In particular, the proposed mechanism employs a small memory, which can be accessed before actual cache access, to record the tag information of the L1 data cache. By using this information, the mechanism can know the status of the L1 data cache and use it as a bypassing hint to make the cache bypassing decision close to optimal. Our experimental results based on a modern GPU platform reveal that our proposed cache bypassing technique achieves up to 10.4% of IPC improvement on average.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Study on L1 Data Cache Bypassing Methods for High-Performance GPUs

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Application Characteristics-Aware Sporadic Cache Bypassing for high performance GPGPUs
Cong Thuan Do ... Cheol Hong Kim
Journal of Parallel and Distributed Computing | VOL. 122
Cong Thuan Do, et. al.Cong Thuan Do ... Cheol Hong Kim
10 Sep 2018
Journal of Parallel and Distributed Computing | VOL. 122

Adaptive Cache Management for Energy-Efficient GPU Computing
Xuhao Chen ... Wen-Mei Hwu
-
Xuhao Chen, et. al.Xuhao Chen ... Wen-Mei Hwu
01 Dec 2014
01 Dec 2014

Hint-assisted scheduling on modern GPUs
Xun Gong
-
Xun GongXun Gong
10 May 2021
10 May 2021

An Empirically Optimized Radix Sort for GPU
Bonan Huang ... Xiaoming Li
-
Bonan Huang, et. al.Bonan Huang ... Xiaoming Li
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Study on L1 Data Cache Bypassing Methods for High-Performance GPUs

Abstract

Talk to us

Similar Papers