SSN: Learning Sparse Switchable Normalization via SparsestMax

Wenqi Shao,Jiamin Ren,Ruimao Zhang,Ping Luo,Xiaogang Wang,Jingyu Li

doi:10.1007/s11263-019-01269-y

Abstract

Normalization method deals with parameters training of convolution neural networks (CNNs) in which there are often multiple convolution layers. Despite the fact that layers in CNN are not homogeneous in the role they play at representing a prediction function, existing works often employ identical normalizer in different layers, making performance away from idealism. To tackle this problem and further boost performance, a recently-proposed switchable normalization (SN) provides a new perspective for deep learning: it learns to select different normalizers for different convolution layers of a ConvNet. However, SN uses softmax function to learn importance ratios to combine normalizers, not only leading to redundant computations compared to a single normalizer but also making model less interpretable. This work addresses this issue by presenting sparse switchable normalization (SSN) where the importance ratios are constrained to be sparse. Unlike $$\ell _1$$ and $$\ell _0$$ regularizations that impose difficulties in tuning layer-wise regularization coefficients, we turn this sparse-constrained optimization problem into feed-forward computation by proposing SparsestMax, which is a sparse version of softmax. SSN has several appealing properties. (1) It inherits all benefits from SN such as applicability in various tasks and robustness to a wide range of batch sizes. (2) It is guaranteed to select only one normalizer for each normalization layer, avoiding redundant computations and improving interpretability of normalizer selection. (3) SSN can be transferred to various tasks in an end-to-end manner. Extensive experiments show that SSN outperforms its counterparts on various challenging benchmarks such as ImageNet, COCO, Cityscapes, ADE20K, Kinetics and MegaFace. Models and code are available at https://github.com/switchablenorms/Sparse_SwitchNorm .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SSN: Learning Sparse Switchable Normalization via SparsestMax

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Vision

Lead the way for us

Journal: International Journal of Computer Vision	Publication Date: Dec 9, 2019
Citations: 3

Similar Papers

SSN: Learning Sparse Switchable Normalization via SparsestMax
Wenqi Shao ... Yudian Li
-
Wenqi Shao, et. al.Wenqi Shao ... Yudian Li
01 Jun 2019
01 Jun 2019

Switchable Normalization for Learning-to-Normalize Deep Representation
Ping Luo ... Ruimao Zhang
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 43
Ping Luo, et. al.Ping Luo ... Ruimao Zhang
08 Jan 2021
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 43

An Extended Convolutional Neural Network with Smaller Structure for Fault Diagnosis of Gearbox
Yuqi Lu ... Lulu Liu
-
Yuqi Lu, et. al.Yuqi Lu ... Lulu Liu
01 Aug 2019
01 Aug 2019

Analysis of the Layers in Convolutional Neural Network in the Context of Text Recognition
Vladimir Pe�A Mauricio ... Grisales Victor
Indian Journal of Science and Technology | VOL. 11
Vladimir Pe�A Mauricio, et. al.Vladimir Pe�A Mauricio ... Grisales Victor
01 Aug 2018
Indian Journal of Science and Technology | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SSN: Learning Sparse Switchable Normalization via SparsestMax

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Vision