Cell division

Hanmin Park,Kiyoung Choi

doi:10.1145/3287624.3287721

Abstract

The datapath bit-width of hardware accelerators for convolutional neural network (CNN) inference is generally chosen to be wide enough, so that they can be used to process upcoming unknown CNNs. Here we introduce the cell division technique, which is a variant of function-preserving transformations. With this technique, it is guaranteed that CNNs that have weights quantized to fixed-point format of arbitrary bit-widths, can be transformed to CNNs with less bit-widths of weights without any accuracy drop (or any accuracy change). As a result, CNN hardware accelerators are released from the weight bit-width constraint, which has been preventing them from having narrower datapaths. In addition, CNNs that have wider weight bit-widths than those assumed by a CNN hardware accelerator can be executed on the accelerator. Experimental results on LeNet-300-100, LeNet-5, AlexNet, and VGG-16 show that weights can be reduced down to 2--5 bits with 2.5X--5.2X decrease in weight storage requirement and of course without any accuracy drop.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cell division

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Prototype of Low Complexity CNN Hardware Accelerator with FPGA-based PYNQ Platform for Dual-Mode Biometrics Recognition
Yu-Hsiang Chen ... Chih-Peng Fan
-
Yu-Hsiang Chen, et. al.Yu-Hsiang Chen ... Chih-Peng Fan
21 Oct 2020
21 Oct 2020

IVS-Caffe-Hardware-Oriented Neural Network Model Development.
Chia-Chi Tsai ... Jiun-In Guo
IEEE transactions on neural networks and learning systems | VOL. 33
Chia-Chi Tsai, et. al.Chia-Chi Tsai ... Jiun-In Guo
01 Oct 2022
IEEE transactions on neural networks and learning systems | VOL. 33

FPGA-Based Implementation of a Real-Time Object Recognition System Using Convolutional Neural Network
Ali Azarmi Gilan ... Bijan Alizadeh
IEEE Transactions on Circuits and Systems II: Express Briefs | VOL. 67
Ali Azarmi Gilan, et. al.Ali Azarmi Gilan ... Bijan Alizadeh
27 Jun 2019
IEEE Transactions on Circuits and Systems II: Express Briefs | VOL. 67

Implementation of Lightweight Convolutional Neural Networks with an Early Exit Mechanism Utilizing 40 nm CMOS Process for Fire Detection in Unmanned Aerial Vehicles.
Yu-Pei Liang ... Ching-Che Chung
Sensors | VOL. 24
Yu-Pei Liang, et. al.Yu-Pei Liang ... Ching-Che Chung
02 Apr 2024
Sensors | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cell division

Abstract

Talk to us

Similar Papers