Compressing Neural Networks With Inter Prediction and Linear Transformation

Kang-Ho Lee,Sung-Ho Bae

doi:10.1109/access.2021.3077596

Abstract

Because of resource-constrained environments, network compression has become an essential part of deep neural networks research. In this paper, we found a mutual relationship between kernel weights termed as Inter-Layer Kernel Correlation (ILKC). The kernel weights between two different convolution layers share a substantial similarity in shapes and values. Based on this relationship, we propose a new compression method, Inter-Layer Kernel Prediction (ILKP), which represents convolutional kernels with fewer bits through similarity between kernel weights in convolutional neural networks. Furthermore, to effectively adapt the inter prediction scheme from video coding technology, we integrate a linear transformation into the prediction scheme, which significantly enhances compression efficiency. The proposed method achieved 93.77% top-1 accuracy with $4.1\times $ compression ratio compared to the ResNet110 baseline model on CIFAR10. It means that 0.04% top-1 accuracy improvement was achieved by using less memory footprint. Moreover, incorporating quantization, the proposed method achieved a $13\times $ compression ratio with little performance degradation compared to the ResNets baseline model trained on CIFAR10 and CIFAR100.

Highlights

Deep Neural Networks (DNN), Convolutional Neural Networks (CNN), are showing exceptional performance compared with traditional methods for a wide variety of tasks in many fields such as image classification [1]–[3], object detection [4]–[6], and speech recognition [7], [8]
Based on Inter-Layer Kernel Correlation (ILKC), this paper proposes a simple and effective weight compression method InterLayer Kernel Prediction (ILKP) that effectively shares the weights by prediction
Based on ILKC, we propose a simple and useful model weight compression method, Inter-Layer Kernel Prediction (ILKP) that minimizes the weight sizes by prediction

Summary

INTRODUCTION

Deep Neural Networks (DNN), Convolutional Neural Networks (CNN), are showing exceptional performance compared with traditional methods for a wide variety of tasks in many fields such as image classification [1]–[3], object detection [4]–[6], and speech recognition [7], [8] With this performance improvement, the size of the CNN model has increased enormously, and the recent works are expanding in size with more model parameters for better performance. Representative methods include pruning [9]–[12], quantization [13]–[15], knowledge distillation [16]–[18], weight sharing [19]–[22], and efficient structural design methods, e.g., Depthwise Separable Convolution [23]–[26] These methods are widely used to compress the size of the CNN model. Based on ILKC, this paper proposes a simple and effective weight compression method InterLayer Kernel Prediction (ILKP) that effectively shares the weights by prediction

CONTRIBUTIONS Main contributions of this paper are listed below:

ABLATION STUDY OF ILKP

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 29	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Compressing Neural Networks With Inter Prediction and Linear Transformation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

IKW: Inter-Kernel Weights for Power Efficient Edge Computing
Pramod Udupa ... Sehwan Lee
IEEE Access | VOL. 8
Pramod Udupa, et. al.Pramod Udupa ... Sehwan Lee
01 Jan 2020
IEEE Access | VOL. 8

A Cooperative Co-Evolutionary Genetic Neural Network and its Application
Pu Xingcheng ... Sun Pengfei
-
Pu Xingcheng, et. al.Pu Xingcheng ... Sun Pengfei
01 Jan 2012
01 Jan 2012

Research on improved convolutional wavelet neural network
Jingwei Liu ... Jiaxin Li
Scientific Reports | VOL. 11
Jingwei Liu, et. al.Jingwei Liu ... Jiaxin Li
09 Sep 2021
Scientific Reports | VOL. 11

DEEP LEARNING FRAMEWORK FOR WOVEN COMPOSITE ANALYSIS
Haotian Feng ... Pavana Prabhakar
-
Haotian Feng, et. al.Haotian Feng ... Pavana Prabhakar
20 Sep 2021
20 Sep 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Compressing Neural Networks With Inter Prediction and Linear Transformation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access