A Practical Highly Paralleled ReRAM-Based DNN Accelerator by Reusing Weight Pattern Repetitions

Yuhao Zhang,Runzhen Xue,Zhiping Jia,Hongchao Du,Zhaoyan Shen,Zili Shao

doi:10.1109/tcad.2021.3071116

Yuhao Zhang, Runzhen Xue + Show 4 more

https://doi.org/10.1109/tcad.2021.3071116

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Resistive random access memory (ReRAM)-based processing-in-memory (PIM) architecture has been designed to accelerate deep neural networks (DNNs) by concurring computation and memory barriers. To further improve memory and computation efficiency, the weight sparsity characteristic has been explored to optimize the ReRAM-based DNN accelerators. However, these designs only focus on compressing zero weights to eliminate ineffectual computation. In this article, we thoroughly analyze the weight distribution characteristics of several typical DNN models and observe many nonzero weight pattern repetitions (WPRs). Therefore, there is an opportunity to further improve the performance and energy efficiency by reusing these WPR. We propose a novel ReRAM-based accelerator—PattPIM, to achieve space compression and computation reuse by exploring DNN WPR based on practical ReRAM crossbars. In PattPIM, we propose a configurable WPR-aware DNN engine and a WPR-to-OU mapping scheme to save both space and computation resources. An intraprocessing engine (PE) pipeline is designed to improve the parallelism of the computation process. Furthermore, we adopt an approximate weight pattern transform algorithm to improve the DNN WPR ratio to enhance the reuse efficiency with negligible accuracy loss. Our evaluation with 6 DNN models shows that the proposed PattPIM delivers significant performance improvement, ReRAM resource efficiency and energy saving.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

A Practical Highly Paralleled ReRAM-Based DNN Accelerator by Reusing Weight Pattern Repetitions

Abstract

Published Version

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Lead the way for us

Journal: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems	Publication Date: Apr 5, 2021
Citations: 7

Similar Papers

ReBoc: Accelerating Block-Circulant Neural Networks in ReRAM
Yitu Wang ... Fan Chen
-
Yitu Wang, et. al.Yitu Wang ... Fan Chen
01 Mar 2020
01 Mar 2020

A comparative evaluation of deep convolutional neural network and deep neural network-based land use/land cover classifications of mining regions using fused multi-sensor satellite data
Ajay Kumar ... Amit Kumar Gorai
Advances in Space Research | VOL. 72
Ajay Kumar, et. al.Ajay Kumar ... Amit Kumar Gorai
04 Sep 2023
Advances in Space Research | VOL. 72

ReCom: An efficient resistive accelerator for compressed deep neural networks
Houxiang Ji ... Yiran Chen
-
Houxiang Ji, et. al.Houxiang Ji ... Yiran Chen
01 Mar 2018
01 Mar 2018

Understanding adversarial attack and defense towards deep compressed neural networks
Qi Liu ... Wujie Wen
-
Qi Liu, et. al.Qi Liu ... Wujie Wen
03 May 2018
03 May 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

A Practical Highly Paralleled ReRAM-Based DNN Accelerator by Reusing Weight Pattern Repetitions

Abstract

Published Version

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems