Coarse-Grained Pruning of Neural Network Models Based on Blocky Sparse Structure.

Lan Huang,Wencong Wang,Kangping Wang,Shiqi Sun,Jia Zeng,Yan Wang

doi:10.3390/e23081042

Abstract

Deep neural networks may achieve excellent performance in many research fields. However, many deep neural network models are over-parameterized. The computation of weight matrices often consumes a lot of time, which requires plenty of computing resources. In order to solve these problems, a novel block-based division method and a special coarse-grained block pruning strategy are proposed in this paper to simplify and compress the fully connected structure, and the pruned weight matrices with a blocky structure are then stored in the format of Block Sparse Row (BSR) to accelerate the calculation of the weight matrices. First, the weight matrices are divided into square sub-blocks based on spatial aggregation. Second, a coarse-grained block pruning procedure is utilized to scale down the model parameters. Finally, the BSR storage format, which is much more friendly to block sparse matrix storage and computation, is employed to store these pruned dense weight blocks to speed up the calculation. In the following experiments on MNIST and Fashion-MNIST datasets, the trend of accuracies with different pruning granularities and different sparsity is explored in order to analyze our method. The experimental results show that our coarse-grained block pruning method can compress the network and can reduce the computational cost without greatly degrading the classification accuracy. The experiment on the CIFAR-10 dataset shows that our block pruning strategy can combine well with the convolutional networks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy	Publication Date: Aug 13, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Coarse-Grained Pruning of Neural Network Models Based on Blocky Sparse Structure.

Abstract

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

A comparative evaluation of deep convolutional neural network and deep neural network-based land use/land cover classifications of mining regions using fused multi-sensor satellite data
Ajay Kumar ... Amit Kumar Gorai
Advances in Space Research | VOL. 72
Ajay Kumar, et. al.Ajay Kumar ... Amit Kumar Gorai
04 Sep 2023
Advances in Space Research | VOL. 72

High-performance and energy-efficient deep learning for resource-constrained devices
Ao Ren
-
Ao RenAo Ren
10 May 2021
10 May 2021

Comparative analysis on Deep Convolution Neural Network models using Pytorch and OpenCV DNN frameworks for identifying optimum fruit detection solution on RISC-V architecture
Shalini K ... Abhishek Kumar Srivastava
-
Shalini K, et. al.Shalini K ... Abhishek Kumar Srivastava
24 Oct 2021
24 Oct 2021

Robustness analysis and experimental validation of a deep neural network for acoustic source imaging
Qing Li ... Yu Liu
Mechanical Systems and Signal Processing | VOL. 216
Qing Li, et. al.Qing Li ... Yu Liu
04 May 2024
Mechanical Systems and Signal Processing | VOL. 216

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Coarse-Grained Pruning of Neural Network Models Based on Blocky Sparse Structure.

Abstract

Talk to us

Similar Papers

More From: Entropy