CCR: A concise convolution rule for sparse neural network accelerators

Jiajun Li,Guihai Yan,Xiaowei Li,Wenyan Lu,Shuhao Jiang,Jingya Wu,Shijun Gong

doi:10.23919/date.2018.8342001

Abstract

Convolutional Neural networks (CNNs) have achieved great success in a broad range of applications. As CNN-based methods are often both computation and memory intensive, sparse CNNs have emerged as an effective solution to reduce the amount of computation and memory accesses while maintaining the high accuracy. However, dense CNN accelerators can hardly benefit from the reduction of computations and memory accesses due to the lack of support for irregular and sparse models. This paper proposed a concise convolution rule (CCR) to diminish the gap between sparse CNNs and dense CNN accelerators. CCR transforms a sparse convolution into multiple effective and ineffective ones. The ineffective convolutions in which either the neurons or synapses are all zeros do not contribute to the final results and the computations and memory accesses can be eliminated. The effective convolutions in which both the neurons and synapses are dense can be easily mapped to the existing dense CNN accelerators. Unlike prior approaches which trade complexity for flexibility, CCR advocates a novel approach to reaping the benefits from the reduction of computation and memory accesses as well as the acceleration of the existing dense architectures without intrusive PE modifications. As a case study, we implemented a sparse CNN accelerator, SparseK, following the rationale of CCR. The experiments show that SparseK achieved a speedup of 2.9× on VGG16 compared to a comparably provisioned dense architecture. Compared with state-of-the-art sparse accelerators, SparseK can improve the performance and energy efficiency by 1.8× and 1.5×, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CCR: A concise convolution rule for sparse neural network accelerators

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An FPGA Design Framework for CNN Sparsification and Acceleration
Sicheng Li ... Yu Wang
-
Sicheng Li, et. al.Sicheng Li ... Yu Wang
01 Apr 2017
01 Apr 2017

SCNN
Angshuman Parashar ... Stephen W Keckler
-
Angshuman Parashar, et. al.Angshuman Parashar ... Stephen W Keckler
24 Jun 2017
24 Jun 2017

SCNN
Angshuman Parashar ... Rangharajan Venkatesan
ACM SIGARCH Computer Architecture News | VOL. 45
Angshuman Parashar, et. al.Angshuman Parashar ... Rangharajan Venkatesan
24 Jun 2017
ACM SIGARCH Computer Architecture News | VOL. 45

SparTen
Ashish Gondimalla ... Mithuna Thottethodi
-
Ashish Gondimalla, et. al.Ashish Gondimalla ... Mithuna Thottethodi
12 Oct 2019
12 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CCR: A concise convolution rule for sparse neural network accelerators

Abstract

Talk to us

Similar Papers