Eager Memory Cryptography in Caches

Xin Wang,Jagadish B Kotra,Xun Jian

doi:10.1109/micro56248.2022.00054

Abstract

To protect memory values from adversaries with physical access to data centers, secure memory systems ensure memory confidentiality and integrity via memory encryption and verification. The corresponding cryptography calculations require a memory block’s write counter as input. As such, CPUs today cache counters in the memory controller (MC). Due to the large memory footprint and irregular access patterns of many real-world applications, MC’s counter cache is too small to achieve high hit rate. A promising solution is also caching counters in the much bigger Last Level cache (LLC). As such, many prior works use LLC as a second level cache for counters to back up the smaller counter cache in MC. Caching counters in LLC introduces a new problem, however. Modern server CPUs have a long LLC access latency that not only can diminish the benefit of caching counters in LLC, but also can sometimes significantly increase counter access latency compared to not caching counters in LLC. We note the problem lies with MC sitting behind LLC; due to its physical location, MC can only see LLC misses and, therefore, can only serially access and use counters after data miss in LLC has completed. However, prior designs without caching counters in LLC can access and use counters in parallel with accessing data. If a block’s counter misses in MC’s counter cache, MC can fetch the counter from DRAM in parallel with data; if the counter hits in MC’s counter cache, MC can use counters for cryptography calculation in parallel with data traveling from DRAM to MC. To parallelize the access and use of counters with data access while caching counters in LLC, we observe that in modern CPUs, L2 is typically the first place that caches data from DRAM (i.e., L2 and L3 are non-inclusive); as such, data from DRAM need not be decrypted and verified until they reach L2. So it is possible to offload some decryption and verification tasks from MC to L2. Since L2 sits before L3, L2 can access counter and data in parallel from L3; L2 can also use counters for cryptography calculation in parallel with data traveling from DRAM to L2, instead of just from DRAM to MC. As such, we propose caching and using counters directly in L2 and refer to this idea as Eager Memory Cryptography in Caches (EMCC). Our evaluation shows that when applied to the state-of-the-art baseline, EMCC improves performance of large and/or irregular workloads by 7%, on average.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Eager Memory Cryptography in Caches

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Analyzing Memory Access on CPU-GPGPU Shared LLC Architecture
Jianliang Ma ... Licheng Yu
-
Jianliang Ma, et. al.Jianliang Ma ... Licheng Yu
01 Jun 2015
01 Jun 2015

Efficient Cache Resizing policy for DRAM-based LLCs in ChipMultiprocessors
Bindu Agarwalla ... Nilkanta Sahu
Journal of Systems Architecture | VOL. 113
Bindu Agarwalla, et. al.Bindu Agarwalla ... Nilkanta Sahu
17 Sep 2020
Journal of Systems Architecture | VOL. 113

Energy Efficient Last Level Caches via Last Read/Write Prediction
Marco A.Z Alves ... Matthias Diener
-
Marco A.Z Alves, et. al.Marco A.Z Alves ... Matthias Diener
01 Oct 2013
01 Oct 2013

Towards Efficient Dynamic LLC Home Bank Mapping with NoC-Level Support
Mario Lodde ... Manuel E Acacio
-
Mario Lodde, et. al.Mario Lodde ... Manuel E Acacio
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Eager Memory Cryptography in Caches

Abstract

Talk to us

Similar Papers