Cache Management Policies Research Articles

Last-Level Cache (LLC) represents the bulk of a modern CPU processor's transistor budget and is essential for application performance as LLC enables fast access to data in contrast to much slower main memory. However, applications with large working set size often exhibit streaming and/or thrashing access patterns at LLC. As a result, a large fraction of the LLC capacity is occupied by dead blocks that will not be referenced again, leading to inefficient utilization of the LLC capacity. To improve cache efficiency, the state-of-the-art cache management techniques employ prediction mechanisms that learn from the past access patterns with an aim to accurately identify as many dead blocks as possible. Once identified, dead blocks are evicted from LLC to make space for potentially high reuse cache blocks. In this thesis, we identify variability in the reuse behavior of cache blocks as the key limiting factor in maximizing cache efficiency for state-of-the-art predictive techniques. Variability in reuse prediction is inevitable due to numerous factors that are outside the control of LLC. The sources of variability include control-flow variation, speculative execution and contention from cores sharing the cache, among others. Variability in reuse prediction challenges existing techniques in reliably identifying the end of a block's useful lifetime, thus causing lower prediction accuracy, coverage, or both. To address this challenge, this thesis aims to design robust cache management mechanisms and policies for LLC in the face of variability in reuse prediction to minimize cache misses, while keeping the cost and complexity of the hardware implementation low. To that end, we propose two cache management techniques, one domain-agnostic and one domain-specialized, to improve cache efficiency by addressing variability in reuse prediction.

Read full abstract

Memory intensive workloads become increasingly popular on general purpose graphics processing units (GPGPUs), and impose great challenges on the GPGPU memory subsystem design. On the other hand, with the recent development of non-volatile memory (NVM) technologies, hybrid memory combining both DRAM and NVM achieves high performance, low power, and high density simultaneously, which provides a promising main memory design for GPGPUs. In this article, we explore the shared last-level cache management for GPGPUs with consideration of the underlying hybrid main memory. To improve the overall memory subsystem performance, we exploit the characteristics of both the asymmetric read/write latency of the hybrid main memory architecture, as well as the memory coalescing feature of GPGPUs. In particular, to reduce the average cost of L2 cache misses, we prioritize cache blocks from DRAM or NVM based on observations that operations to NVM part of main memory have a large impact on the system performance. Furthermore, the cache management scheme also integrates the GPU memory coalescing and cache bypassing techniques to improve the overall system performance. To minimize the impact of memory divergence behaviors among simultaneously executed groups of threads, we propose a hybrid main memory and warp aware memory scheduling mechanism for GPGPUs. Experimental results show that in the context of a hybrid main memory system, our proposed L2 cache management policy and memory scheduling mechanism improve performance by 15.69% on average for memory intensive benchmarks, whereas the maximum gain can be up to 29% and achieve an average memory subsystem energy reduction of 21.27%.

Read full abstract

Cache Management Policies Research Articles

Related Topics

Articles published on Cache Management Policies

Finding optimal non-datapath caching strategies via network flow

CASHT: Contention Analysis in Shared Hierarchies with Thefts

SSD internal cache management policies: A survey

Performance Improvement of DAG-Aware Task Scheduling Algorithms with Efficient Cache Management in Spark

Efficient streaming subgraph isomorphism with graph neural networks

Design of the IBM z15 microprocessor

Addressing variability in reuse prediction for last-level caches

Selective bypassing and mapping for heterogeneous applications on GPGPUs

Reuse Distance-based Victim Cache for Effective Utilisation of Hybrid Main Memory System

Integrated Cache Scheduling Replacement Algorithm to Reduce Cache Pollution

ECR: Eviction‐cost‐aware cache management policy for page‐level flash‐based SSDs

A Maximum Cache Value Policy in Hybrid Memory-Based Edge Computing for Mobile Devices

Reducing Writebacks Through In-Cache Displacement

A Novel Adaptive Database Cache Optimization Algorithm Based on Predictive Working Sets in Cloud Environment

ReD: A reuse detector for content selection in exclusive shared last-level caches

A fault-tolerant last level cache for CMPs operating at ultra-low voltage

A Multilevel Cache Management Policy for Performance Improvement in Distributed System

Shared Last-Level Cache Management and Memory Scheduling for GPGPUs with Hybrid Main Memory

RT-CaCC: A Reliable Transport With Cache-Aware Congestion Control Protocol in Wireless Sensor Networks

On Caching and Routing in Information-Centric Networks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Cache Management Policies Research Articles

Related Topics

Articles published on Cache Management Policies

Finding optimal non-datapath caching strategies via network flow

CASHT: Contention Analysis in Shared Hierarchies with Thefts

SSD internal cache management policies: A survey

Performance Improvement of DAG-Aware Task Scheduling Algorithms with Efficient Cache Management in Spark

Efficient streaming subgraph isomorphism with graph neural networks

Design of the IBM z15 microprocessor

Addressing variability in reuse prediction for last-level caches

Selective bypassing and mapping for heterogeneous applications on GPGPUs

Reuse Distance-based Victim Cache for Effective Utilisation of Hybrid Main Memory System

Integrated Cache Scheduling Replacement Algorithm to Reduce Cache Pollution

ECR: Eviction‐cost‐aware cache management policy for page‐level flash‐based SSDs

A Maximum Cache Value Policy in Hybrid Memory-Based Edge Computing for Mobile Devices

Reducing Writebacks Through In-Cache Displacement

A Novel Adaptive Database Cache Optimization Algorithm Based on Predictive Working Sets in Cloud Environment

ReD: A reuse detector for content selection in exclusive shared last-level caches

A fault-tolerant last level cache for CMPs operating at ultra-low voltage

A Multilevel Cache Management Policy for Performance Improvement in Distributed System

Shared Last-Level Cache Management and Memory Scheduling for GPGPUs with Hybrid Main Memory

RT-CaCC: A Reliable Transport With Cache-Aware Congestion Control Protocol in Wireless Sensor Networks

On Caching and Routing in Information-Centric Networks