Compression of Deep Neural Networks with Structured Sparse Ternary Coding

Yoonho Boo,Wonyong Sung

doi:10.1007/s11265-018-1418-z

Abstract

Deep neural networks (DNNs) contain large number of weights, and usually require many off-chip memory accesses for inference. Weight size compression is a major requirement for on-chip memory based implementation of DNNs, which not only increases inference speed but also reduces power consumption. We propose a weight compression method for deep neural networks by combining pruning and quantization. The proposed method allows weights to have values of + 1 or − 1 only at predetermined positions. Then, a look-up table stores all possible combinations of sub-vectors of weight matrices. Encoding and decoding structured sparse weights can be conducted easily with the table. This method not only allows multiplication-free DNN implementations but also compresses the weight storage by as much as x32 times more than that in floating-point networks and with only a tiny performance loss. Weight distribution normalization and gradual pruning techniques are applied to lower performance degradation. Experiments are conducted with fully connected DNNs and convolutional neural networks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Compression of Deep Neural Networks with Structured Sparse Ternary Coding

Abstract

Talk to us

Similar Papers

More From: Journal of Signal Processing Systems

Lead the way for us

Similar Papers

A graph-based interpretability method for deep neural networks
Tao Wang ... Chunyan Xu
Neurocomputing | VOL. 555
Tao Wang, et. al.Tao Wang ... Chunyan Xu
03 Aug 2023
Neurocomputing | VOL. 555

Biologically motivated learning method for deep neural networks using hierarchical competitive learning
Takashi Shinozaki
Neural Networks | VOL. 144
Takashi ShinozakiTakashi Shinozaki
03 Sep 2021
Neural Networks | VOL. 144

Compression of Deep Neural Networks on the Fly
Guillaume Soulié ... Maëlys Robert
-
Guillaume Soulié, et. al.Guillaume Soulié ... Maëlys Robert
01 Jan 2015
01 Jan 2015

Automatic recognition of Rice Plant leaf diseases detection using deep neural network with improved threshold neural network
K Mahadevan ... J Suresh
e-Prime - Advances in Electrical Engineering, Electronics and Energy | VOL. 8
K Mahadevan, et. al.K Mahadevan ... J Suresh
29 Mar 2024
e-Prime - Advances in Electrical Engineering, Electronics and Energy | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Compression of Deep Neural Networks with Structured Sparse Ternary Coding

Abstract

Talk to us

Similar Papers

More From: Journal of Signal Processing Systems