Power optimization through peripheral circuit reusing integrated with loop tiling for RRAM crossbar-based CNN

Yuanhui Ni,Weiwen Chen,Keni Qiu,Wenjuan Cui,Yuanchun Zhou

doi:10.23919/date.2018.8342193

Abstract

Convolutional neural networks (CNNs) have been proposed to be widely adopted to make predictions on a large amount of data in modern embedded systems. Prior studies have shown that convolutional computations which consist of numbers of multiply and accumulate (MAC) operations, serve as the most computationally expensive portion in CNN. Compared to the manner of executing MAC operations in GPU and FPGA, CNN implementation in the RRAM crossbar-based computing system (RCS) demonstrates the outstanding advantages of high performance and low power. However, the current design is energy-unbalanced among the three parts of RRAM crossbar computation, peripheral circuits and memory accesses, the latter two factors can significantly limit the potential gains of RCS. Addressing the problem of high power overhead of peripheral circuits in RCS, the Peripheral Circuit Unit (PeriCU)-Reuse scheme has been proposed to meet given power budget. In this paper, it is further observed that memory accesses can be bypassed if two adjacent layers are assigned in different PeriCUs. In this way, memory accesses can be reduced and thus the performance and power can be improved. A loop tiling technique is proposed to save memory accesses. The experiments of two convolutional applications validate that the proposed loop tiling technique can reduce energy consumption by 61.7%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Power optimization through peripheral circuit reusing integrated with loop tiling for RRAM crossbar-based CNN

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Low power driven loop tiling for RRAM crossbar-based CNN
Yuanhui Ni ... Keni Qiu
-
Yuanhui Ni, et. al.Yuanhui Ni ... Keni Qiu
09 Apr 2018
09 Apr 2018

A peripheral circuit reuse structure integrated with a retimed data flow for low power RRAM crossbar-based CNN
Keni Qiu ... Zili Shao
-
Keni Qiu, et. al.Keni Qiu ... Zili Shao
01 Mar 2018
01 Mar 2018

Aspects of programming for implementation of convolutional neural networks on multisystem HPC architectures
Sunil Pandey ... Shrish Verma
Journal of Physics: Conference Series | VOL. 2062
Sunil Pandey, et. al.Sunil Pandey ... Shrish Verma
01 Nov 2021
Journal of Physics: Conference Series | VOL. 2062

FPGA-Based Implementation of a CNN Architecture for the On-Board Processing of Very High-Resolution Remote Sensing Images
Romen Neris ... Sebastian Lopez
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 15
Romen Neris, et. al.Romen Neris ... Sebastian Lopez
01 Jan 2021
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Power optimization through peripheral circuit reusing integrated with loop tiling for RRAM crossbar-based CNN

Abstract

Talk to us

Similar Papers