An Energy-Efficient Digital ReRAM-Crossbar-Based CNN With Bitwise Parallelism

Leibin Ni,Zichuan Liu,Rajiv V Joshi,Hao Yu

doi:10.1109/jxcdc.2017.2697910

Abstract

There is great attention to develop hardware accelerator with better energy efficiency, as well as throughput, than GPUs for convolutional neural network (CNN). The existing solutions have relatively limited parallelism as well as large power consumption (including leakage power). In this paper, we present a resistive random access memory (ReRAM)-accelerated CNN that can achieve significantly higher throughput and energy efficiency when the CNN is trained with binary constraints on both weights and activations, and is further mapped on a digital ReRAM-crossbar. We propose an optimized accelerator architecture tailored for bitwise convolution that features massive parallelism with high energy efficiency. Numerical experiment results show that the binary CNN accelerator on a digital ReRAM-crossbar achieves a peak throughput of 792 GOPS at the power consumption of 4.5 mW, which is 1.61 times faster and 296 times more energy-efficient than a high-end GPU.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Journal on Exploratory Solid-State Computational Devices and Circuits	Publication Date: Dec 1, 2017
Citations: 68	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

An Energy-Efficient Digital ReRAM-Crossbar-Based CNN With Bitwise Parallelism

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Exploratory Solid-State Computational Devices and Circuits

Lead the way for us

Similar Papers

A 7.663-TOPS 8.2-W Energy-efficient FPGA Accelerator for Binary Convolutional Neural Networks (Abstract Only)
Yixing Li ... Kai Xu
-
Yixing Li, et. al.Yixing Li ... Kai Xu
22 Feb 2017
A 7.663-TOPS 8.2-W Energy-efficient FPGA Accelerator for Binary Convolutional Neural Networks (Abstract Only)
Yixing Li ... Kai Xu

A Configurable Multi-Precision CNN Computing Framework Based on Single Bit RRAM
Zhenhua Zhu ... Song Han
-
Zhenhua Zhu, et. al.Zhenhua Zhu ... Song Han
02 Jun 2019
02 Jun 2019

A GPU-Outperforming FPGA Accelerator Architecture for Binary Convolutional Neural Networks
Yixing Li ... Zichuan Liu
ACM Journal on Emerging Technologies in Computing Systems | VOL. 14
Yixing Li, et. al.Yixing Li ... Zichuan Liu
30 Apr 2018
ACM Journal on Emerging Technologies in Computing Systems | VOL. 14

FangTianSim: High-Level Cycle-Accurate Resistive Random-Access Memory-Based Multi-Core Spiking Neural Network Processor Simulator.
Jinsong Wei ... Junjie An
Frontiers in neuroscience | VOL. 15
Jinsong Wei, et. al.Jinsong Wei ... Junjie An
20 Jan 2022
Frontiers in neuroscience | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Energy-Efficient Digital ReRAM-Crossbar-Based CNN With Bitwise Parallelism

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Exploratory Solid-State Computational Devices and Circuits