Distribution-Sensitive Information Retention for Accurate Binary Neural Network

Haotong Qin,Ruihao Gong,Xianglong Liu,Xiangguo Zhang,Yi Xu,Yifu Ding

doi:10.1007/s11263-022-01687-5

Abstract

Model binarization is an effective method of compressing neural networks and accelerating their inference process, which enables state-of-the-art models to run on resource-limited devices. Recently, advanced binarization methods have been greatly improved by minimizing the quantization error directly in the forward process. However, a significant performance gap still exists between the 1-bit model and the 32-bit one. The empirical study shows that binarization causes a great loss of information in the forward and backward propagation which harms the performance of binary neural networks (BNNs). We present a novel distribution-sensitive information retention network (DIR-Net) that retains the information in the forward and backward propagation by improving internal propagation and introducing external representations. The DIR-Net mainly relies on three technical contributions: (1) Information Maximized Binarization (IMB): minimizing the information loss and the binarization error of weights/activations simultaneously by weight balance and standardization; (2) Distribution-sensitive Two-stage Estimator (DTE): retaining the information of gradients by distribution-sensitive soft approximation by jointly considering the updating capability and accurate gradient; (3) Representation-align Binarization-aware Distillation (RBD): retaining the representation information by distilling the representations between full-precision and binarized networks. The DIR-Net investigates both forward and backward processes of BNNs from the unified information perspective, thereby providing new insight into the mechanism of network binarization. The three techniques in our DIR-Net are versatile and effective and can be applied in various structures to improve BNNs. Comprehensive experiments on the image classification and objective detection tasks show that our DIR-Net consistently outperforms the state-of-the-art binarization approaches under mainstream and compact architectures, such as ResNet, VGG, EfficientNet, DARTS, and MobileNet. Additionally, we conduct our DIR-Net on real-world resource-limited devices which achieves \(11.1\times \) storage saving and \(5.4\times \) speedup.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Distribution-Sensitive Information Retention for Accurate Binary Neural Network

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Vision

Lead the way for us

Journal: International Journal of Computer Vision	Publication Date: Oct 2, 2022
Citations: 25

Similar Papers

Forward and Backward Information Retention for Accurate Binary Neural Networks
Haotong Qin ... Xianglong Liu
-
Haotong Qin, et. al.Haotong Qin ... Xianglong Liu
01 Jun 2020
01 Jun 2020

Binary neural networks for image super-resolution
馨蕊姜 ... 新波高
SCIENTIA SINICA Informationis | VOL. 51
馨蕊姜, et. al.馨蕊姜 ... 新波高
01 Oct 2021
SCIENTIA SINICA Informationis | VOL. 51

Regularizing Binary Neural Networks via Ensembling for Efficient Person Re-Identification
Ayse Serbetci ... Yusuf Sinan Akgul
IEEE Access | VOL. 11
Ayse Serbetci, et. al.Ayse Serbetci ... Yusuf Sinan Akgul
01 Jan 2023
IEEE Access | VOL. 11

Balanced Binary Neural Networks with Gated Residual
Mingzhu Shen ... Xianglong Liu
-
Mingzhu Shen, et. al.Mingzhu Shen ... Xianglong Liu
02 Apr 2020
02 Apr 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Distribution-Sensitive Information Retention for Accurate Binary Neural Network

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Vision