Automatic DenseNet Sparsification

Tao Li,Guoqiang Zhong,Li-Na Wang,Wencong Jiao

doi:10.1109/access.2020.2984130

Abstract

As a classic and well-performed deep convolutional neural network, DenseNet links every layer to each of its preceding layers via skip connections. However, the dense connectivity of the links leads to much redundance, consuming lots of computational resources. In this paper, to automatically prune redundant skip connections in DenseNet, we introduce a novel reinforcement learning method called automatic DenseNet sparsification (ADS). In ADS, we use adjacent matrix to represent dense connections in DenseNet, and design an agent using recurrent neural networks (RNNs) to sparsify the matrix, i. e. removing redundant skip connections in DenseNet. The validation accuracies of the sparsified DenseNets are used as rewards to update the agent, which promotes the agent to generate sparsified DenseNets with high performance. Extensive experiments demonstrate the effectiveness of ADS: The performance of the sparsified DenseNet surpasses not only the original DenseNet but related models; Moreover, the sparsified DenseNet has strong transferability when it is applied to new tasks. More importantly, ADS is very efficient. For the compression of a 40-layer DenseNet, it takes less than 1 day on a single GPU.

Highlights

In recent years, convolutional neural networks (CNNs) are widely applied in many pattern recognition and computer vision tasks, such as image classification, object tracking and image super-resolution [1]–[4]
1) EXPERIMENTAL RESULTS OBTAINED BY automatic DenseNet sparsification (ADS) The CIFAR-10 dataset consists of 60,000 images belonging to 10 classes
In this paper, we present a novel method called automatic DenseNet sparsification (ADS) to prune unimportant skip connections in DenseNet based on reinforcement learning

Summary

INTRODUCTION

Convolutional neural networks (CNNs) are widely applied in many pattern recognition and computer vision tasks, such as image classification, object tracking and image super-resolution [1]–[4]. Following DenseNet, some deep learning approaches using dense connections have been proposed. Compared with some previous reinforcement learning algorithms, which validate the generated deep models by training them from scratch and require huge computational resources (e.g. 104 − 105 GPU hours) [14], [15], ADS is very efficient by employing the weight inheritance technique. It only takes less than one day on a single GPU for compressing a 40-layer DenseNet. The contributions of this paper can be summarized as follows:.

DenseNet

OPTIMIZATION OF THE RL AGENT

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 15	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Automatic DenseNet Sparsification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Synthesizing 3D Acoustic-Articulatory Mapping Trajectories: Predicting Articulatory Movements by Long-Term Recurrent Convolutional Neural Network
Lingyun Yu ... Jun Yu
-
Lingyun Yu, et. al.Lingyun Yu ... Jun Yu
01 Dec 2018
01 Dec 2018

INTELLIGENT MODEL FOR CLASSIFYING HEMODYNAMIC PATTERNS OF BRAIN ACTIVATION TO IDENTIFY NEUROCOGNITIVE MECHANISMS OF SPATIAL-NUMERICAL ASSOCIATIONS
R G Asadullaev ... M A Sitnikova
Vestnik komp'iuternykh i informatsionnykh tekhnologii | VOL. -
R G Asadullaev, et. al.R G Asadullaev ... M A Sitnikova
01 Jan 2024
Vestnik komp'iuternykh i informatsionnykh tekhnologii | VOL. -

Semi-Supervised Deep Learning Using Pseudo Labels for Hyperspectral Image Classification.
Hao Wu ... Saurabh Prasad
IEEE Transactions on Image Processing | VOL. 27
Hao Wu, et. al.Hao Wu ... Saurabh Prasad
13 Nov 2017
IEEE Transactions on Image Processing | VOL. 27

A Lyapunov-stability-based context-layered recurrent pi-sigma neural network for the identification of nonlinear systems
Rajesh Kumar
Applied Soft Computing | VOL. 122
Rajesh KumarRajesh Kumar
18 Apr 2022
Applied Soft Computing | VOL. 122

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic DenseNet Sparsification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access