Towards high performance low bitwidth training for deep neural networks

Chunyou Su,Sheng Zhou,Liang Feng,Wei Zhang

doi:10.1088/1674-4926/41/2/022404

Abstract

The high performance of the state-of-the-art deep neural networks (DNNs) is acquired at the cost of huge consumption of computing resources. Quantization of networks is recently recognized as a promising solution to solve the problem and significantly reduce the resource usage. However, the previous quantization works have mostly focused on the DNN inference, and there were very few works to address on the challenges of DNN training. In this paper, we leverage dynamic fixed-point (DFP) quantization algorithm and stochastic rounding (SR) strategy to develop a fully quantized 8-bit neural networks targeting low bitwidth training. The experiments show that, in comparison to the full-precision networks, the accuracy drop of our quantized convolutional neural networks (CNNs) can be less than 2%, even when applied to deep models evaluated on ImageNet dataset. Additionally, our 8-bit GNMT translation network can achieve almost identical BLEU to full-precision network. We further implement a prototype on FPGA and the synthesis shows that the low bitwidth training scheme can reduce the resource usage significantly.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Semiconductors	Publication Date: Feb 1, 2020
Citations: 10	License type: iop-standard

R Discovery Prime

R Discovery Prime

Towards high performance low bitwidth training for deep neural networks

Abstract

Talk to us

Similar Papers

More From: Journal of Semiconductors

Lead the way for us

Similar Papers

HNPU: An Adaptive DNN Training Processor Utilizing Stochastic Dynamic Fixed-Point and Active Bit-Precision Searching
Donghyeon Han ... Dongseok Im
IEEE Journal of Solid-State Circuits | VOL. 56
Donghyeon Han, et. al.Donghyeon Han ... Dongseok Im
24 Mar 2021
IEEE Journal of Solid-State Circuits | VOL. 56

Neuroevolution in Deep Neural Networks: Current Trends and Future Challenges
Edgar Galvan ... Peter Mooney
IEEE Transactions on Artificial Intelligence | VOL. 2
Edgar Galvan, et. al.Edgar Galvan ... Peter Mooney
04 May 2021
IEEE Transactions on Artificial Intelligence | VOL. 2

Centered Weight Normalization in Accelerating Training of Deep Neural Networks
Lei Huang ... Dacheng Tao
-
Lei Huang, et. al.Lei Huang ... Dacheng Tao
01 Oct 2017
01 Oct 2017

TxSim: Modeling Training of Deep Neural Networks on Resistive Crossbar Systems
Sourjya Roy ... Shrihari Sridharan
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 29
Sourjya Roy, et. al.Sourjya Roy ... Shrihari Sridharan
31 Mar 2021
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards high performance low bitwidth training for deep neural networks

Abstract

Talk to us

Similar Papers

More From: Journal of Semiconductors