Class Difficulty based Mixed Precision Quantization for Low Complexity CNN Training

Joongho Jo,Jongsun Park

doi:10.1109/isocc56007.2022.10031597

Abstract

Low-bit quantization of CNN training is highly needed for reducing the computational complexity of convolutional neural network (CNN) training. In CNN training, some of the classes can finish training early (reaches high accuracy in early training epochs) while other classes need more time (epochs) to finish training. This measure of training difficulty can be efficiently exploited for the mixed precision quantization to reduce the computational complexity of CNN training. In this paper, we present a training difficulty based mixed precision training approach, where easy-to-train classes are trained using low-bit quantization and the hard-to-train classes are trained using high bit quantization. The simulation results show that the proposed mixed precision training can achieve 1.33X improved compression ratio with the same accuracy compared to 8-bit (activations and weights) and 16-bit (gradients of activation and weight) uniform quantization training for ResNet-20 using the CIFAR-10 dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Class Difficulty based Mixed Precision Quantization for Low Complexity CNN Training

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Acceleration of Deep Neural Network Training Using Field Programmable Gate Arrays.
Guta Tesema Tufa ... Anchit Bijalwan
Computational Intelligence and Neuroscience | VOL. 2022
Guta Tesema Tufa, et. al.Guta Tesema Tufa ... Anchit Bijalwan
17 Oct 2022
Computational Intelligence and Neuroscience | VOL. 2022

Real-Time CNN Training and Compression for Neural-Enhanced Adaptive Live Streaming.
Seunghwa Jeong ... Junyong Noh
IEEE transactions on pattern analysis and machine intelligence | VOL. 46
Seunghwa Jeong, et. al.Seunghwa Jeong ... Junyong Noh
01 Sep 2024
IEEE transactions on pattern analysis and machine intelligence | VOL. 46

Comparative characteristics of the ability of convolutional neural networks to the concept of transfer learning
Vladimir Khotsyanovsky
Technology audit and production reserves | VOL. 1
Vladimir KhotsyanovskyVladimir Khotsyanovsky
11 Feb 2022
Technology audit and production reserves | VOL. 1

FPGA-Based Accelerator for Deep Convolutional Neural Networks for the SPARK Environment
Raghid Morcel ... Mazen Ezzeddine
-
Raghid Morcel, et. al.Raghid Morcel ... Mazen Ezzeddine
01 Nov 2016
01 Nov 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Class Difficulty based Mixed Precision Quantization for Low Complexity CNN Training

Abstract

Talk to us

Similar Papers