QLP: Deep Q-Learning for Pruning Deep Neural Networks

Efe Camci,Jie Lin,Manas Gupta,Min Wu

doi:10.1109/tcsvt.2022.3167951

Abstract

We present a novel, deep Q-learning based method, QLP, for pruning deep neural networks (DNNs). Given a DNN, our method intelligently determines favorable layer-wise sparsity ratios, which are then implemented via unstructured, magnitude-based, weight pruning. In contrast to previous reinforcement learning (RL) based pruning methods, our method is not forced to prune a DNN within a single, sequential pass from the first layer to the last. It visits each layer multiple times and prunes them little by little at each visit, achieving superior granular pruning. Moreover, our method is not restricted to a subset of actions within the feasible action space. It has the flexibility to execute a whole range of sparsity ratios (0% - 100%) for each layer. This enables aggressive pruning without compromising accuracy. Furthermore, our method does not require a complex state definition; it features a simple, generic definition that is composed of only the index and the density of the layers, which leads to less computational demand while observing the state at each interaction. Lastly, our method utilizes a carefully designed curriculum that enables learning targeted policies for each sparsity regime, which helps to deliver better accuracy, especially at high sparsity levels. We conduct batched performance tests at compelling sparsity levels (up to 98%), present extensive ablation studies to justify our RL-related design choices, and compare our method with the state-of-the-art, including RL-based and other pruning methods. Our method sets the new state-of-the-art results in most of the experiments with ResNet-32 and ResNet-56 over CIFAR-10 dataset as well as ResNet-50 and MobileNet-v1 over ILSVRC2012 (ImageNet) dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

QLP: Deep Q-Learning for Pruning Deep Neural Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Oct 1, 2022
Citations: 13

Similar Papers

StructADMM: Achieving Ultrahigh Efficiency in Structured Pruning for DNNs
Tianyun Zhang ... Xue Lin
IEEE Transactions on Neural Networks and Learning Systems | VOL. 33
Tianyun Zhang, et. al.Tianyun Zhang ... Xue Lin
01 May 2022
IEEE Transactions on Neural Networks and Learning Systems | VOL. 33

Towards efficient deep neural network execution with model compression and platform-specific optimization
Xiaolong Ma
-
Xiaolong MaXiaolong Ma
10 Feb 2023
10 Feb 2023

Truth Table Net: Scalable, Compact & Verifiable Neural Networks with a Dual Convolutional Small Boolean Circuit Networks Form
Adrien Benamira ... Trevor Yap
-
Adrien Benamira, et. al.Adrien Benamira ... Trevor Yap
01 Aug 2024
01 Aug 2024

ADMM-NN
Ao Ren ... Tianyun Zhang
-
Ao Ren, et. al.Ao Ren ... Tianyun Zhang
04 Apr 2019
04 Apr 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

QLP: Deep Q-Learning for Pruning Deep Neural Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology