Automatic Pruning Rate Derivation for Structured Pruning of Deep Neural Networks

Yasufumi Sakai,Hiroshi Kawaguchi,Tsuguchika Tabaru,Atsuki Inoue,Akinori Iwakawa

doi:10.1109/icpr56361.2022.9956644

Yasufumi Sakai, Hiroshi Kawaguchi + Show 3 more

https://doi.org/10.1109/icpr56361.2022.9956644

Copy DOI

Export

Save

Cite

Publication Date: Aug 21, 2022

Citations: 3

Affiliation: Kobe University, Fujitsu (Japan)

Abstract
Full-Text
Similar Papers

Abstract

Listen

To compress the neural network model, structured pruning has been proposed. However, finding a proper pruning rate to suppress the accuracy degradation of pruned models is difficult because existing structured pruning methods assign the pruning rate manually. As described herein, we propose an automatic pruning rate derivation method for structured pruning to reduce the workload of inefficient manual pruning rate assignment. The value of the pruning error (L1-norm of the pruned weight) depends on the pruning rate. Therefore, to derive the pruning rate, our method compares the pruning error and the threshold. When the pruning error is less than the threshold, the degradation of the pruned model accuracy is suppressed. We demonstrate the superiority of our proposed method over state-of-the-art methods on CIFAR-10 and ImageNet using various ResNets. For example, the proposed method reduces 56.2% parameters of ResNet-50 with similar accuracy of 75.32% to earlier works on ImageNet.

Full Text