Width &amp; Depth Pruning for Vision Transformers

Fang Yu,Meng Wang,Wei Chu,Kun Huang,Yuan Cheng,Li Cui

doi:10.1609/aaai.v36i3.20222

Abstract

Transformer models have demonstrated their promising potential and achieved excellent performance on a series of computer vision tasks. However, the huge computational cost of vision transformers hinders their deployment and application to edge devices. Recent works have proposed to ﬁnd and remove the unimportant units of vision transformers. Despite achieving remarkable results, these methods take one dimension of network width into consideration and ignore network depth, which is another important dimension for pruning vision transformers. Therefore, we propose a Width & Depth Pruning (WDPruning) framework that reduces both width and depth dimensions simultaneously. Speciﬁcally, for width pruning, a set of learnable pruning-related parameters is used to adaptively adjust the width of transformer. For depth pruning, we introduce several shallow classiﬁers by using the intermediate information of the transformer blocks, which allows images to be classiﬁed by shallow classiﬁers instead of the deeper classiﬁers. In the inference period, all of the blocks after shallow classiﬁers can be dropped so they don’t bring additional parameters and computation. Experimental results on benchmark datasets demonstrate that the proposed method can signiﬁcantly reduce the computational costs of mainstream vision transformers such as DeiT and Swin Transformer with a minor accuracy drop. In particular, on ILSVRC-12, we achieve over 22% pruning ratio of FLOPs by compressing DeiT-Base, even with an increase of 0.14% Top-1 accuracy.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Width & Depth Pruning for Vision Transformers

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jun 28, 2022
Citations: 20

Similar Papers

HypernasalityNet: Deep recurrent neural network for automatic hypernasality detection
Xiyue Wang ... Ling He
International journal of bio-medical computing | VOL. 129
Xiyue Wang, et. al.Xiyue Wang ... Ling He
23 May 2019
International journal of bio-medical computing | VOL. 129

One-dimensional convolutional neural networks for acoustic waste sorting
Gang Lu ... Jun Zou
Journal of cleaner production | VOL. 271
Gang Lu, et. al.Gang Lu ... Jun Zou
20 Jun 2020
Journal of cleaner production | VOL. 271

A Comparison and Introduction of Novel Solar Panel’s Fault Diagnosis Technique Using Deep-Features Shallow-Classifier through Infrared Thermography
Waqas Ahmed ... Muhammad Umair Ali
Energies | VOL. 16
Waqas Ahmed, et. al.Waqas Ahmed ... Muhammad Umair Ali
17 Jan 2023
Energies | VOL. 16

Unifying transformer and convolution for dam crack detection
Erhu Zhang ... Yang Wang
Automation in Construction | VOL. 147
Erhu Zhang, et. al.Erhu Zhang ... Yang Wang
20 Dec 2022
Automation in Construction | VOL. 147

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Width &amp; Depth Pruning for Vision Transformers

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Width & Depth Pruning for Vision Transformers