Efficient deep neural network training via decreasing precision with layer capacity

Ao Shen,Zhiquan Lai,Tao Sun,Shengwei Li,Keshi Ge,Weijie Liu,Dongsheng Li

doi:10.1007/s11704-024-40669-3

Ao Shen, Zhiquan Lai + Show 5 more

Open Access

https://doi.org/10.1007/s11704-024-40669-3

Copy DOI

Export

Save

Cite

Journal: Frontiers of Computer Science	Publication Date: Jan 28, 2025
License type: cc-by

Abstract
Full-Text
Similar Papers

Abstract

Listen

Low-precision training has emerged as a practical approach, saving the cost of time, memory, and energy during deep neural networks (DNNs) training. Typically, the use of lower precision introduces quantization errors that need to be minimized to maintain model performance, often neglecting to consider the potential benefits of reducing training precision. This paper rethinks low-precision training, highlighting the potential benefits of lowering precision: (1) low precision can serve as a form of regularization in DNN training by constraining excessive variance in the model; (2) layer-wise low precision can be seen as an alternative dimension of sparsity, orthogonal to pruning, contributing to improved generalization in DNNs. Based on these analyses, we propose a simple yet powerful technique–DPC (Decreasing Precision with layer Capacity), which directly assigns different bit-widths to model layers, without the need for an exhaustive analysis of the training process or any delicate low-precision criteria. Thorough extensive experiments on five datasets and fourteen models across various applications consistently demonstrate the effectiveness of the proposed DPC technique in saving computational cost (−16.21%–−44.37%) while achieving comparable or even superior accuracy (up to +0.68%, +0.21% on average). Furthermore, we offer feature embedding visualizations and conduct further analysis with experiments to investigate the underlying mechanisms behind DPC’s effectiveness, enhancing our understanding of low-precision training. Our source code will be released upon paper acceptance.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Efficient deep neural network training via decreasing precision with layer capacity

Abstract

Published Version

Talk to us

Similar Papers

More From: Frontiers of Computer Science

Lead the way for us

Similar Papers

PCM: Precision-Controlled Memory System for Energy Efficient Deep Neural Network Training
Boyeal Kim ... Hyuk-Jae Lee
-
Boyeal Kim, et. al.Boyeal Kim ... Hyuk-Jae Lee
01 Mar 2020
01 Mar 2020

A Framework for Distributed Deep Neural Network Training with Heterogeneous Computing Platforms
Bontak Gu ... Arslan Munir
-
Bontak Gu, et. al.Bontak Gu ... Arslan Munir
01 Dec 2019
01 Dec 2019

Neuroevolution in Deep Neural Networks: Current Trends and Future Challenges
Edgar Galvan ... Peter Mooney
IEEE Transactions on Artificial Intelligence | VOL. 2
Edgar Galvan, et. al.Edgar Galvan ... Peter Mooney
04 May 2021
IEEE Transactions on Artificial Intelligence | VOL. 2

HNPU: An Adaptive DNN Training Processor Utilizing Stochastic Dynamic Fixed-Point and Active Bit-Precision Searching
Donghyeon Han ... Youngwoo Kim
IEEE Journal of Solid-State Circuits | VOL. 56
Donghyeon Han, et. al.Donghyeon Han ... Youngwoo Kim
24 Mar 2021
IEEE Journal of Solid-State Circuits | VOL. 56

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Efficient deep neural network training via decreasing precision with layer capacity

Abstract

Published Version

Talk to us

Similar Papers

More From: Frontiers of Computer Science