Learning Bilateral Clipping Parametric Activation for Low-Bit Neural Networks

Yunlong Ding,Di-Rong Chen

doi:10.3390/math11092001

Abstract

Among various network compression methods, network quantization has developed rapidly due to its superior compression performance. However, trivial activation quantization schemes limit the compression performance of network quantization. Most conventional activation quantization methods directly utilize the rectified activation functions to quantize models, yet their unbounded outputs generally yield drastic accuracy degradation. To tackle this problem, we propose a comprehensive activation quantization technique namely Bilateral Clipping Parametric Rectified Linear Unit (BCPReLU) as a generalized version of all rectified activation functions, which limits the quantization range more flexibly during training. Specifically, trainable slopes and thresholds are introduced for both positive and negative inputs to find more flexible quantization scales. We theoretically demonstrate that BCPReLU has approximately the same expressive power as the corresponding unbounded version and establish its convergence in low-bit quantization networks. Extensive experiments on a variety of datasets and network architectures demonstrate the effectiveness of our trainable clipping activation function.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Bilateral Clipping Parametric Activation for Low-Bit Neural Networks

Abstract

Talk to us

Similar Papers

More From: Mathematics

Lead the way for us

Journal: Mathematics	Publication Date: Apr 23, 2023
License type: CC BY 4.0

Similar Papers

Adaptive vector quantization of wavelet coefficient for image compression
Y.H Ang ... S.H Ong
-
Y.H Ang, et. al.Y.H Ang ... S.H Ong
09 Sep 1997
09 Sep 1997

Continuous fiber-reinforced 2.5D hybrid lattice structures with superior compression performance via self-supporting suspension printing
Ke Dong ... Yi Xiong
Composites Science and Technology | VOL. 257
Ke Dong, et. al.Ke Dong ... Yi Xiong
02 Sep 2024
Composites Science and Technology | VOL. 257

Optimization Based Layer-Wise Pruning Threshold Method for Accelerating Convolutional Neural Networks
Yunlong Ding ... Di-Rong Chen
Mathematics | VOL. 11
Yunlong Ding, et. al.Yunlong Ding ... Di-Rong Chen
27 Jul 2023
Mathematics | VOL. 11

Still Image Compression with Adaptive Resolution Vector Quantization Technique
Takahiro Nakayama ... Koji Takeuchi
Intelligent Automation & Soft Computing | VOL. 10
Takahiro Nakayama, et. al.Takahiro Nakayama ... Koji Takeuchi
01 Jan 2004
Intelligent Automation & Soft Computing | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Bilateral Clipping Parametric Activation for Low-Bit Neural Networks

Abstract

Talk to us

Similar Papers

More From: Mathematics