Filtering Translation Bandwidth with Virtual Caching

Hongil Yoon,Gurindar S Sohi,Jason Lowe-Power

doi:10.1145/3296957.3173195

Hongil Yoon, Gurindar S Sohi + Show 1 more

Open Access

https://doi.org/10.1145/3296957.3173195

Copy DOI

Abstract

Heterogeneous computing with GPUs integrated on the same chip as CPUs is ubiquitous, and to increase programmability many of these systems support virtual address accesses from GPU hardware. However, this entails address translation on every memory access. We observe that future GPUs and workloads show very high bandwidth demands (up to 4 accesses per cycle in some cases) for shared address translation hardware due to frequent private TLB misses. This greatly impacts performance (32% average performance degradation relative to an ideal MMU). To mitigate this overhead, we propose a software-agnostic, practical, GPU virtual cache hierarchy. We use the virtual cache hierarchy as an effective address translation bandwidth filter. We observe many requests that miss in private TLBs find corresponding valid data in the GPU cache hierarchy. With a GPU virtual cache hierarchy, these TLB misses can be filtered (i.e., virtual cache hits), significantly reducing bandwidth demands for the shared address translation hardware. In addition, accelerator-specific attributes (e.g., less likelihood of synonyms) of GPUs reduce the design complexity of virtual caches, making a whole virtual cache hierarchy (including a shared L2 cache) practical for GPUs. Our evaluation shows that the entire GPU virtual cache hierarchy effectively filters the high address translation bandwidth, achieving almost the same performance as an ideal MMU. We also evaluate L1-only virtual cache designs and show that using a whole virtual cache hierarchy obtains additional performance benefits (1.31× speedup on average).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Filtering Translation Bandwidth with Virtual Caching

Abstract

Talk to us

Similar Papers

More From: ACM SIGPLAN Notices

Lead the way for us

Journal: ACM SIGPLAN Notices	Publication Date: Mar 19, 2018
Citations: 5

Similar Papers

Filtering Translation Bandwidth with Virtual Caching
Hongil Yoon ... Gurindar S Sohi
-
Hongil Yoon, et. al.Hongil Yoon ... Gurindar S Sohi
19 Mar 2018
19 Mar 2018

Efficient Synonym Filtering and Scalable Delayed Translation for Hybrid Virtual Caching
Chang Hyun Park ... Jaehyuk Huh
-
Chang Hyun Park, et. al.Chang Hyun Park ... Jaehyuk Huh
01 Jun 2016
01 Jun 2016

Efficient synonym filtering and scalable delayed translation for hybrid virtual caching
Chang Hyun Park ... Taekyung Heo
ACM SIGARCH Computer Architecture News | VOL. 44
Chang Hyun Park, et. al.Chang Hyun Park ... Taekyung Heo
18 Jun 2016
ACM SIGARCH Computer Architecture News | VOL. 44

Efficient synonym filtering and scalable delayed translation for hybrid virtual caching
Chang Hyun Park ... Taekyung Heo
ACM SIGARCH Computer Architecture News | VOL. 44
Chang Hyun Park, et. al.Chang Hyun Park ... Taekyung Heo
18 Jun 2016
ACM SIGARCH Computer Architecture News | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Filtering Translation Bandwidth with Virtual Caching

Abstract

Talk to us

Similar Papers

More From: ACM SIGPLAN Notices