Accelerator Memory Reuse in the Dark Silicon Era

Emilio G Cota,Mario R Casu,Michele Petracca,Paolo Mantovani,Luca P Carloni

doi:10.1109/l-ca.2012.29

Abstract

Accelerators integrated on-die with General-Purpose CPUs (GP-CPUs) can yield significant performance and power improvements. Their extensive use, however, is ultimately limited by their area overhead; due to their high degree of specialization, the opportunity cost of investing die real estate on accelerators can become prohibitive, especially for general-purpose architectures. In this paper we present a novel technique aimed at mitigating this opportunity cost by allowing GP-CPU cores to reuse accelerator memory as a non-uniform cache architecture (NUCA) substrate. On a system with a last level-2 cache of 128kB, our technique achieves on average a 25% performance improvement when reusing four 512 kB accelerator memory blocks to form a level-3 cache. Making these blocks reusable as NUCA slices incurs on average in a 1.89% area overhead with respect to equally-sized ad hoc cache slices.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Computer Architecture Letters	Publication Date: Jan 24, 2014
Citations: 35	License type: other-oa

R Discovery Prime

R Discovery Prime

Accelerator Memory Reuse in the Dark Silicon Era

Abstract

Talk to us

Similar Papers

More From: IEEE Computer Architecture Letters

Lead the way for us

Similar Papers

Interconnect design considerations for large NUCA caches
Naveen Muralimanohar ... Rajeev Balasubramonian
ACM SIGARCH Computer Architecture News | VOL. 35
Naveen Muralimanohar, et. al.Naveen Muralimanohar ... Rajeev Balasubramonian
09 Jun 2007
ACM SIGARCH Computer Architecture News | VOL. 35

Interconnect design considerations for large NUCA caches
Naveen Muralimanohar ... Rajeev Balasubramonian
-
Naveen Muralimanohar, et. al.Naveen Muralimanohar ... Rajeev Balasubramonian
09 Jun 2007
09 Jun 2007

Performance and network power evaluation of tightly mixed SRAM NUCA for 3D Multi-core Network on Chips
Yuang Zhang ... Minglun Gao
-
Yuang Zhang, et. al.Yuang Zhang ... Minglun Gao
01 Jan 2014
01 Jan 2014

A novel migration-based NUCA design for chip multiprocessors
...
-
, et. al. ...
15 Nov 2008
15 Nov 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accelerator Memory Reuse in the Dark Silicon Era

Abstract

Talk to us

Similar Papers

More From: IEEE Computer Architecture Letters