CLU

Palash Das,Hemangee K Kapoor

doi:10.1145/3427472

Abstract

Convolutional/Deep Neural Networks (CNNs/DNNs) are rapidly growing workloads for the emerging AI-based systems. The gap between the processing speed and the memory-access latency in multi-core systems affects the performance and energy efficiency of the CNN/DNN tasks. This article aims to alleviate this gap by providing a simple and yet efficient near-memory accelerator-based system that expedites the CNN inference. Towards this goal, we first design an efficient parallel algorithm to accelerate CNN/DNN tasks. The data is partitioned across the multiple memory channels (vaults) to assist in the execution of the parallel algorithm. Second, we design a hardware unit, namely the convolutional logic unit (CLU), which implements the parallel algorithm. To optimize the inference, the CLU is designed, and it works in three phases for layer-wise processing of data. Last, to harness the benefits of near-memory processing (NMP), we integrate homogeneous CLUs on the logic layer of the 3D memory, specifically the Hybrid Memory Cube (HMC). The combined effect of these results in a high-performing and energy-efficient system for CNNs/DNNs. The proposed system achieves a substantial gain in the performance and energy reduction compared to multi-core CPU- and GPU-based systems with a minimal area overhead of 2.37%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CLU

Abstract

Talk to us

Similar Papers

More From: ACM Journal on Emerging Technologies in Computing Systems

Lead the way for us

Journal: ACM Journal on Emerging Technologies in Computing Systems	Publication Date: Apr 15, 2021
Citations: 5

Similar Papers

Memory Coalescing for Hybrid Memory Cube
Xi Wang ... Yong Chen
-
Xi Wang, et. al.Xi Wang ... Yong Chen
13 Aug 2018
13 Aug 2018

Exploring Cache Size and Core Count Tradeoffs in Systems with Reduced Memory Access Latency
Paulo C Santos ... Matthias Diener
-
Paulo C Santos, et. al.Paulo C Santos ... Matthias Diener
01 Feb 2016
01 Feb 2016

Power-Time Exploration Tools for NMP-Enabled Systems
Chae Eun Rhee ... Hyuk-Jae Lee
Electronics | VOL. 8
Chae Eun Rhee, et. al.Chae Eun Rhee ... Hyuk-Jae Lee
28 Sep 2019
Electronics | VOL. 8

Enabling energy efficient Hybrid Memory Cube systems with erasure codes
Shibo Wang ... Engin Ipek
-
Shibo Wang, et. al.Shibo Wang ... Engin Ipek
01 Jul 2015
01 Jul 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CLU

Abstract

Talk to us

Similar Papers

More From: ACM Journal on Emerging Technologies in Computing Systems