MAX2: An ReRAM-Based Neural Network Accelerator That Maximizes Data Reuse and Area Utilization

Manqing Mao,Shimeng Yu,Rui Liu,Jingtao Li,Xiaochen Peng,Chaitali Chakrabarti

doi:10.1109/jetcas.2019.2908937

Abstract

Although recent advances in resistive random access memory (ReRAM)-based accelerator designs for deep convolutional neural networks (CNNs) offer energy-efficiency improvements over CMOS-based accelerators, they have a large number of energy consuming data transactions. In this paper, we propose MAX2, a multi-tile ReRAM accelerator framework for supporting multiple CNN topologies, that maximizes on-chip data reuse and reduces on-chip bandwidth to minimize energy consumption due to data movement. Building upon the fact that a large filter can be built with a stack of smaller ( $3\times 3$ ) filters, we design every tile with nine processing elements (PEs). Each PE consists of multiple ReRAM subarrays to compute the dot product. The PEs operate in a systolic fashion, thereby maximizing input feature map reuse and minimizing interconnection cost. MAX2 chooses the data size granularity in the systolic array in conjunction with weight duplication to achieve very high area utilization without requiring additional peripheral circuits. We provide a detailed energy and area breakdown of each component at the PE level, tile level, and system level. The system-level evaluation in 32-nm node on several VGG-network benchmarks shows that the MAX2 can improve computation efficiency (TOPs/s/mm2) by $2.5\times $ and energy efficiency (TOPs/s/W) by $5.2\times $ compared with a state-of-the-art ReRAM-based accelerator.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Journal on Emerging and Selected Topics in Circuits and Systems	Publication Date: Jun 1, 2019
Citations: 52	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

MAX2: An ReRAM-Based Neural Network Accelerator That Maximizes Data Reuse and Area Utilization

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Emerging and Selected Topics in Circuits and Systems

Lead the way for us

Similar Papers

A Versatile ReRAM-based Accelerator for Convolutional Neural Networks
Manqing Mao ... Shimeng Yu
-
Manqing Mao, et. al.Manqing Mao ... Shimeng Yu
01 Oct 2018
01 Oct 2018

A Configurable Multi-Precision CNN Computing Framework Based on Single Bit RRAM
Zhenhua Zhu ... Song Han
-
Zhenhua Zhu, et. al.Zhenhua Zhu ... Song Han
02 Jun 2019
02 Jun 2019

(鎳、鈦與鎢)氧化物之電性及應用於電阻式隨機記憶體研究

-

01 Jan 2012
01 Jan 2012

Mixed size crossbar based RRAM CNN accelerator with overlapped mapping method
Zhenhua Zhu ... Hanbo Sun
-
Zhenhua Zhu, et. al.Zhenhua Zhu ... Hanbo Sun
05 Nov 2018
05 Nov 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MAX2: An ReRAM-Based Neural Network Accelerator That Maximizes Data Reuse and Area Utilization

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Emerging and Selected Topics in Circuits and Systems