TD-NUCA: Runtime Driven Management of NUCA Caches in Task Dataflow Programming Models

Paul Caheny,Miquel Moreto,Lluc Alvarez,Marc Casas

doi:10.1109/sc41404.2022.00085

Paul Caheny, Miquel Moreto + Show 2 more

Open Access

https://doi.org/10.1109/sc41404.2022.00085

Copy DOI

Abstract

In high performance processors, the design of on-chip memory hierarchies is crucial for performance and energy efficiency. Current processors rely on large shared Non-Uniform Cache Architectures (NUCA) to improve performance and reduce data movement. Multiple solutions exploit information available at the microarchitecture level or in the operating system to optimize NUCA performance. However, existing methods have not taken advantage of the information captured by task dataflow programming models to guide the management of NUCA caches. In this paper we propose TD-NUCA, a hardware/software co-designed approach that leverages information present in the run-time system of task dataflow programming models to efficiently manage NUCA caches. TD-NUCA identifies the data access and reuse patterns of parallel applications in the runtime system and guides the operation of the NUCA caches in the hardware. As a result, TD-NUCA achieves a 1.18x average speedup over the baseline S-NUCA while requiring only 0.62x the data movement.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TD-NUCA: Runtime Driven Management of NUCA Caches in Task Dataflow Programming Models

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Nov 1, 2022
Citations: 1	License type: other-oa

Similar Papers

An adaptive migration–replication scheme (AMR) for shared cache in chip multiprocessors
Nitin Chaturvedi ... S Gurunarayanan
The Journal of Supercomputing | VOL. 71
Nitin Chaturvedi, et. al.Nitin Chaturvedi ... S Gurunarayanan
26 Jul 2015
The Journal of Supercomputing | VOL. 71

Exploiting Static Non-Uniform Cache Architectures for Hard Real-Time Computing
Yiqiang Ding ... Wei Zhang
Journal of Computing Science and Engineering | VOL. 9
Yiqiang Ding, et. al.Yiqiang Ding ... Wei Zhang
30 Dec 2015
Journal of Computing Science and Engineering | VOL. 9

An Adaptive Block Pinning Cache for Reducing Network Traffic in Multi-core Architectures
Nitin Chaturvedi ... S Gurunarayanan
-
Nitin Chaturvedi, et. al.Nitin Chaturvedi ... S Gurunarayanan
01 Sep 2013
01 Sep 2013

Data Access Type Aware Replacement Policy for Cache Clustering Organization of Chip Multiprocessors
Chongmin Li ... Haixia Wang
-
Chongmin Li, et. al.Chongmin Li ... Haixia Wang
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TD-NUCA: Runtime Driven Management of NUCA Caches in Task Dataflow Programming Models

Abstract

Talk to us

Similar Papers