A NUCA substrate for flexible CMP cache sharing

Jaehyuk Huh,Doug Burger,Changkyu Kim,Lixin Zhang,Hazim Shafi,Stephen W Keckler

doi:10.1145/1088149.1088154

Abstract

We propose an organization for the on-chip memory system of a chip multiprocessor, in which 16 processors share a 16MB pool of 256 L2 cache banks. The L2 cache is organized as a non-uniform cache architecture (NUCA) array with a switched network embedded in it for high performance. We show that this organization can support the spectrum of degrees of sharing: unshared, in which each processor has a private portion of the cache, thus reducing hit latency, completely shared, in which every processor shares the entire cache, thus minimizing misses, and every point in between. We find the optimal degree of sharing for a number of cache bank mapping policies, and also evaluate a per-application cache partitioning strategy. We conclude that a static NUCA organization with sharing degrees of two or four work best across a suite of commercial and scientific parallel workloads. We also demonstrate that migratory, dynamic NUCA approaches improve performance significantly for a subset of the workloads at the cost of increased power consumption and complexity, especially as per-application cache partitioning strategies are applied.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A NUCA substrate for flexible CMP cache sharing

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Exploring the relationship between architectures and management policies in the design of NUCA-based chip multicore systems
Sandro Bartolini ... Cosimo Antonio Prete
Future Generation Computer Systems | VOL. 78
Sandro Bartolini, et. al.Sandro Bartolini ... Cosimo Antonio Prete
06 Jul 2017
Future Generation Computer Systems | VOL. 78

A NUCA Substrate for Flexible CMP Cache Sharing
Jaehyuk Huh ... Changkyu Kim
IEEE Transactions on Parallel and Distributed Systems | VOL. 18
Jaehyuk Huh, et. al.Jaehyuk Huh ... Changkyu Kim
01 Aug 2007
IEEE Transactions on Parallel and Distributed Systems | VOL. 18

Supporting faulty banks in NUCA by NoC assisted remapping mechanisms
Kuei-Chung Chang ... Chin-Sheng Yu
The Journal of Supercomputing | VOL. 67
Kuei-Chung Chang, et. al.Kuei-Chung Chang ... Chin-Sheng Yu
25 Aug 2013
The Journal of Supercomputing | VOL. 67

Data Remapping for Static NUCA in Degradable Chip Multiprocessors
Ying Wang ... Yin-He Han
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 23
Ying Wang, et. al. Ying Wang ... Yin-He Han
01 May 2015
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A NUCA substrate for flexible CMP cache sharing

Abstract

Talk to us

Similar Papers