Dynamic Slimmable Network

Changlin Li,Bing Wang,Guangrun Wang,Zhihui Li,Xiaojun Chang,Xiaodan Liang

doi:10.1109/cvpr46437.2021.00850

Abstract

Current dynamic networks and dynamic pruning methods have shown their promising capability in reducing theoretical computation complexity. However, dynamic sparse patterns on convolutional filters fail to achieve actual acceleration in real-world implementation, due to the extra burden of indexing, weight-copying, or zero-masking. Here, we explore a dynamic network slimming regime, named Dynamic Slimmable Network (DS-Net), which aims to achieve good hardware-efficiency via dynamically adjusting filter numbers of networks at test time with respect to different inputs, while keeping filters stored statically and contiguously in hardware to prevent the extra burden. Our DS-Net is empowered with the ability of dynamic inference by the proposed double-headed dynamic gate that comprises an attention head and a slimming head to predictively adjust network width with negligible extra computation cost. To ensure generality of each candidate architecture and the fairness of gate, we propose a disentangled two-stage training scheme inspired by one-shot NAS. In the first stage, a novel training technique for weight-sharing networks named In-place Ensemble Bootstrapping is proposed to improve the supernet training efficacy. In the second stage, Sandwich Gate Sparsification is proposed to assist the gate training by identifying easy and hard samples in an online way. Extensive experiments demonstrate our DS-Net consistently outperforms its static counterparts as well as state-of-the-art static and dynamic model compression methods by a large margin (up to 5.9%). Typically, DS-Net achieves 2-4× computation reduction and 1.62× real-world acceleration over ResNet-50 and MobileNet with minimal accuracy drops on ImageNet. <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sup>

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dynamic Slimmable Network

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

DS-Net++: Dynamic Weight Slicing for Efficient Inference in CNNs and Vision Transformers.
Changlin Li ... Xiaodan Liang
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Changlin Li, et. al.Changlin Li ... Xiaodan Liang
01 Apr 2023
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

Dynamic Slimmable Denoising Network.
Zutao Jiang ... Ling Chen
IEEE Transactions on Image Processing | VOL. PP
Zutao Jiang, et. al.Zutao Jiang ... Ling Chen
01 Jan 2023
IEEE Transactions on Image Processing | VOL. PP

Few-shot Link Prediction in Dynamic Networks
Cheng Yang ... Chuan Shi
-
Cheng Yang, et. al.Cheng Yang ... Chuan Shi
11 Feb 2022
11 Feb 2022

Cryotherapy with dynamic intermittent compression for analgesia after anterior cruciate ligament reconstruction. Preliminary study
J Murgier ... X Cassard
Orthopaedics & Traumatology: Surgery & Research | VOL. 100
J Murgier, et. al.J Murgier ... X Cassard
25 Mar 2014
Orthopaedics & Traumatology: Surgery & Research | VOL. 100

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamic Slimmable Network

Abstract

Talk to us

Similar Papers