Incremental Layers Resection: A Novel Method to Compress Neural Networks

Xiang Liu,Guoqiang Zhong,Li-Na Wang,Wenxue Liu

doi:10.1109/access.2019.2952615

Abstract

In recent years, deep neural networks (DNNs) have been widely applied in many areas, such as computer vision and pattern recognition. However, we observe that most of the DNNs include redundant layers. Hence, in this paper, we introduce a novel method named incremental layers resection (ILR) to resect the redundant layers in DNNs, while preserving their learning performances. ILR uses a multistage learning strategy to incrementally resect the inconsequential layers. In each stage, it preserves the data representations learned by the original network, while connecting the two nearby layers of each resected one. Particularly, based on a teacher-student knowledge transfer framework, we have designed the layer-level learning and overall learning procedures to enforce the resected network performing similarly with the original one. Extensive experiments demonstrate that, compared to the original networks, the compressed ones by ILR need only about half of the storage space and have higher inference speed. More importantly, they even deliver higher classification accuracy than the original networks.

Highlights

In recent years, deep neural networks (DNNs) have attracted much attention in the areas related to artificial intelligence, such as computer vision and pattern recognition
Deep convolutional neural networks (CNNs) have obtained great successes in the applications of image classification and object detection, since AlexNet [2] won the champion of the ImageNet Large Scale Visual Recognition Competition (ILSVRC) in 2012
2) Compared with previous knowledge distillation (KD) approaches, where the structures of the teacher and student networks may quite different, the student network inherits the structure of the teacher network and only inconsequential layers are removed during the incremental layers resection (ILR) learning process

Summary

INTRODUCTION

Deep neural networks (DNNs) have attracted much attention in the areas related to artificial intelligence, such as computer vision and pattern recognition. In this paper, we propose a new network compression method called incremental layers resection (ILR), to remove the redundant layers in DNNs. ILR combines the ideas of weight pruning and KD. ILR combines the ideas of weight pruning and KD It removes the inconsequential layers, and transfers the knowledge of the original network to the compressed one. 1) Compared with previous weight pruning approaches, which mainly remove the redundant connections, ILR focuses on layer-level resection. 2) Compared with previous KD approaches, where the structures of the teacher and student networks may quite different, the student network inherits the structure of the teacher network and only inconsequential layers are removed during the ILR learning process. As the redundant layers are removed step by step, the original network is incrementally compressed, without any hurt to its performance.

RELATED WORK

SELECTING INCONSEQUENTIAL LAYERS

OVERALL LEARNING

BLOCKS RESECTION

EXPERIMENTS

CONCLUSION AND DISCUSSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Incremental Layers Resection: A Novel Method to Compress Neural Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Growing random forest on deep convolutional neural networks for scene categorization
Shuang Bai
Expert Systems with Applications | VOL. 71
Shuang BaiShuang Bai
17 Oct 2016
Expert Systems with Applications | VOL. 71

Deep neural network model for group activity recognition using contextual relationship
S.A Vahora ... N.C Chauhan
Engineering Science and Technology, an International Journal | VOL. 22
S.A Vahora, et. al.S.A Vahora ... N.C Chauhan
07 Sep 2018
Engineering Science and Technology, an International Journal | VOL. 22

MJOA-MU: End-to-edge collaborative computation for DNN inference based on model uploading
Huan Yang ... Yuwei Wang
Computer Networks | VOL. 231
Huan Yang, et. al.Huan Yang ... Yuwei Wang
06 May 2023
Computer Networks | VOL. 231

Speaker Adaptive Training using Deep Neural Networks
Tsubasa Ochiai ... Chiori Hori
-
Tsubasa Ochiai, et. al.Tsubasa Ochiai ... Chiori Hori
01 May 2014
01 May 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Incremental Layers Resection: A Novel Method to Compress Neural Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access