Training Deep Neural Networks Using Posit Number System

Jinming Lu,Zhongfeng Wang,Jun Lin,Li Du,Chao Fang,Zhisheng Wang,Siyuan Lu

doi:10.1109/socc46988.2019.1570558530

Abstract

With the increasing size of Deep Neural Network (DNN) models, the high memory space requirements and computational complexity have become an obstacle for efficient DNN implementations. To ease this problem, using reduced-precision representations for DNN training and inference has attracted many interests from researchers. This paper first proposes a methodology for training DNNs with the posit arithmetic, a type-3 universal number (Unum) format that is similar to the floating point(FP) but has reduced precision. A warm-up training strategy and layer-wise scaling factors are adopted to stabilize training and fit the dynamic range of DNN parameters. With the proposed training methodology, we demonstrate the first successful training of DNN models on ImageNet image classification task in 16 bits posit with no accuracy loss. Then, an efficient hardware architecture for the posit multiply-and-accumulate operation is also proposed, which can achieve significant improvement in energy efficiency than traditional floating-point implementations. The proposed design is helpful for future low-power DNN training accelerators.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Training Deep Neural Networks Using Posit Number System

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Framework for Distributed Deep Neural Network Training with Heterogeneous Computing Platforms
Bontak Gu ... Arslan Munir
-
Bontak Gu, et. al.Bontak Gu ... Arslan Munir
01 Dec 2019
01 Dec 2019

STADIA: Photonic Stochastic Gradient Descent for Neural Network Accelerators
Chengpeng Xia ... Haibo Zhang
ACM Transactions on Embedded Computing Systems | VOL. 22
Chengpeng Xia, et. al.Chengpeng Xia ... Haibo Zhang
09 Sep 2023
ACM Transactions on Embedded Computing Systems | VOL. 22

A comprehensive exploration of approximate DNN models with a novel floating-point simulation framework
Myeongjin Kwak ... Yongtae Kim
Performance Evaluation | VOL. 165
Myeongjin Kwak, et. al.Myeongjin Kwak ... Yongtae Kim
25 May 2024
Performance Evaluation | VOL. 165

Structured representation in deep neural network systems
Caiwen Ding
-
Caiwen DingCaiwen Ding
10 May 2021
10 May 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Training Deep Neural Networks Using Posit Number System

Abstract

Talk to us

Similar Papers