Locality‐protected cache allocation scheme with low overhead on GPUs

Yang Zhang,Zuocheng Xing,Cang Liu,Chuan Tang

doi:10.1049/iet-cdt.2017.0004

Abstract

Graphics processing units (GPUs) are playing more important roles in parallel computing. Using their multi-threaded execution model, GPUs can accelerate many parallel programmes and save energy. In contrast to their strong computing power, GPUs have limited on-chip memory space which is easy to be inadequate. The throughput-oriented execution model in GPU introduces thousands of hardware threads, which may access the small cache simultaneously. This will cause cache thrashing and contention problems and limit GPU performance. Motivated by these issues, the authors put forward a locality-protected method based on instruction programme counter (LPC) to make use of data locality in L1 data cache with very low hardware overhead. First, they use a simple Program Counter (PC)-based locality detector to collect reuse information of each cache line. Then, a hardware-efficient prioritised cache allocation unit is proposed to coordinate data reuse information with time-stamp information to predict the reuse possibility of each cache line, and to evict the line with the least reuse possibility. Their experiment on the simulator shows that LPC provides an up to 17.8% speedup and an average of 5.0% improvement over the baseline method with very low overhead.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Locality‐protected cache allocation scheme with low overhead on GPUs

Abstract

Talk to us

Similar Papers

More From: IET Computers & Digital Techniques

Lead the way for us

Journal: IET Computers & Digital Techniques	Publication Date: Jan 12, 2018
Citations: 2

Similar Papers

CWLP: coordinated warp scheduling and locality-protected cache allocation on GPUs
Yang Zhang ... Cang Liu
Frontiers of Information Technology & Electronic Engineering | VOL. 19
Yang Zhang, et. al.Yang Zhang ... Cang Liu
01 Feb 2018
Frontiers of Information Technology & Electronic Engineering | VOL. 19

Prediction-Based Error Correction for GPU Reliability with Low Overhead
Hyunyul Lim ... Sungho Kang
Electronics | VOL. 9
Hyunyul Lim, et. al.Hyunyul Lim ... Sungho Kang
05 Nov 2020
Electronics | VOL. 9

GPU-accelerated CFD Simulations for Turbomachinery Design Optimization

-

01 Jan 2018
01 Jan 2018

General Purpose Computation on Graphics Processing Units Using OpenCL

-

01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Locality‐protected cache allocation scheme with low overhead on GPUs

Abstract

Talk to us

Similar Papers

More From: IET Computers &amp; Digital Techniques

More From: IET Computers & Digital Techniques