An Architecture for Integrated Near-Data Processors

Erik Vermij,Christoph Hagleitner,Rik Jongerius,Leandro Fiorin,Jan Van Lunteren,Koen Bertels

doi:10.1145/3127069

Abstract

To increase the performance of data-intensive applications, we present an extension to a CPU architecture that enables arbitrary near-data processing capabilities close to the main memory. This is realized by introducing a component attached to the CPU system-bus and a component at the memory side. Together they support hardware-managed coherence and virtual memory support to integrate the near-data processors in a shared-memory environment. We present an implementation of the components, as well as a system-simulator, providing detailed performance estimations. With a variety of synthetic workloads we demonstrate the performance of the memory accesses, the mixed fine- and coarse-grained coherence mechanisms, and the near-data processor communication mechanism. Furthermore, we quantify the inevitable start-up penalty regarding coherence and data writeback, and argue that near-data processing workloads should access data several times to offset this penalty. A case study based on the Graph500 benchmark confirms the small overhead for the proposed coherence mechanisms and shows the ability to outperform a real CPU by a factor of two.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Architecture for Integrated Near-Data Processors

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Architecture and Code Optimization

Lead the way for us

Journal: ACM Transactions on Architecture and Code Optimization	Publication Date: Sep 6, 2017
Citations: 5

Similar Papers

Practical Mechanisms for Reducing Processor–Memory Data Movement in Modern Workloads

-

21 May 2021
21 May 2021

Near Data Processing and Its Applications
Angelic ... Megha Mahobe
-
Angelic, et. al. Angelic ... Megha Mahobe
01 Jan 2021
01 Jan 2021

FSR: A host-storage collaborative mechanism for data path optimization of NDP operations
Qiao Sun ... Shukan Liu
Journal of Systems Architecture | VOL. 143
Qiao Sun, et. al.Qiao Sun ... Shukan Liu
11 Aug 2023
Journal of Systems Architecture | VOL. 143

Computing with Near Data
Xulong Tang ... Mustafa Karakoy
-
Xulong Tang, et. al.Xulong Tang ... Mustafa Karakoy
20 Jun 2019
20 Jun 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Architecture for Integrated Near-Data Processors

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Architecture and Code Optimization