Fast Approximations of Activation Functions in Deep Neural Networks when using Posit Arithmetic.

Marco Cococcioni,Emanuele Ruffaldi,Sergio Saponara,Federico Rossi

doi:10.3390/s20051515

Marco Cococcioni, Emanuele Ruffaldi + Show 2 more

Open Access

https://doi.org/10.3390/s20051515

Copy DOI

Journal: Sensors (Basel, Switzerland)	Publication Date: Mar 10, 2020
Citations: 20	License type: CC BY 4.0

Affiliation: University of Pisa

Abstract

With increasing real-time constraints being put on the use of Deep Neural Networks (DNNs) by real-time scenarios, there is the need to review information representation. A very challenging path is to employ an encoding that allows a fast processing and hardware-friendly representation of information. Among the proposed alternatives to the IEEE 754 standard regarding floating point representation of real numbers, the recently introduced Posit format has been theoretically proven to be really promising in satisfying the mentioned requirements. However, with the absence of proper hardware support for this novel type, this evaluation can be conducted only through a software emulation. While waiting for the widespread availability of the Posit Processing Units (the equivalent of the Floating Point Unit (FPU)), we can already exploit the Posit representation and the currently available Arithmetic-Logic Unit (ALU) to speed up DNNs by manipulating the low-level bit string representations of Posits. As a first step, in this paper, we present new arithmetic properties of the Posit number system with a focus on the configuration with 0 exponent bits. In particular, we propose a new class of Posit operators called L1 operators, which consists of fast and approximated versions of existing arithmetic operations or functions (e.g., hyperbolic tangent (TANH) and extended linear unit (ELU)) only using integer arithmetic. These operators introduce very interesting properties and results: (i) faster evaluation than the exact counterpart with a negligible accuracy degradation; (ii) an efficient ALU emulation of a number of Posits operations; and (iii) the possibility to vectorize operations in Posits, using existing ALU vectorized operations (such as the scalable vector extension of ARM CPUs or advanced vector extensions on Intel CPUs). As a second step, we test the proposed activation function on Posit-based DNNs, showing how 16-bit down to 10-bit Posits represent an exact replacement for 32-bit floats while 8-bit Posits could be an interesting alternative to 32-bit floats since their performances are a bit lower but their high speed and low storage properties are very appealing (leading to a lower bandwidth demand and more cache-friendly code). Finally, we point out how small Posits (i.e., up to 14 bits long) are very interesting while PPUs become widespread, since Posit operations can be tabulated in a very efficient way (see details in the text).

Highlights

Due to the pervasivenss of real-time and critical systems like Internet of Things (IoT) platforms, automotives, and robotics, new types of requirements are being addressed in the use of Deep Neural Networks (DNNs).Sensors 2020, 20, 1515; doi:10.3390/s20051515 www.mdpi.com/journal/sensorsThe main challenges when dealing with DNNs are both the ubiquitous multiply-and-accumulate operations and the massive use of activation functions across the neural network layers
We propose a new class of Posit operators called L1 operators, which consists of fast and approximated versions of existing arithmetic operations or functions (e.g., hyperbolic tangent (TANH) and extended linear unit (ELU)) only using integer arithmetic
We test the proposed activation function on Posit-based DNNs, showing how 16-bit down to 10-bit Posits represent an exact replacement for 32-bit floats while 8-bit Posits could be an interesting alternative to 32-bit floats since their performances are a bit lower but their high speed and low storage properties are very appealing

Summary

Introduction

Due to the pervasivenss of real-time and critical systems like Internet of Things (IoT) platforms, automotives, and robotics, new types of requirements are being addressed in the use of Deep Neural. Even the use of floating point SIMD engines is not always possible in embedded systems (e.g., ARM Cortex-M4 [3]) This means that we cannot always rely on high-performance processing units in critical and real time scenarios, needing to address new challenges. Being able to write functions as a sequence of arithmetic-logic operations allows us to vectorize them exploiting already existing SIMD (Single Instruction–Multiple Data) engines In this extension, we propose a new fast and approximated version of the Extended Linear Unit (ELU) activation function. We investigate operator tabulation as a different approach to speed up Posit emulation without constraints on the exponent configuration This allows us to accelerate basic arithmetic operators like sum and multiplication that are not suitable for being implemented as L1 functions.

Posit Arithmetic

FastSigmoid

CppPosit Library

Tabulated Posits

Type Proxying

Brain Posits

Implementation Results

Conclusions and Future Work

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fast Approximations of Activation Functions in Deep Neural Networks when using Posit Arithmetic.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

A novel scaled-gamma-tanh (SGT) activation function in 3D CNN applied for MRI classification
Bijen Khagi ... Goo-Rak Kwon
Scientific Reports | VOL. 12
Bijen Khagi, et. al.Bijen Khagi ... Goo-Rak Kwon
02 Sep 2022
Scientific Reports | VOL. 12

Review of Adaptive Activation Function in Deep Neural Network
Mian Mian Lau ... King Hann Lim
-
Mian Mian Lau, et. al.Mian Mian Lau ... King Hann Lim
01 Dec 2018
01 Dec 2018

A cosine-modulated Gaussian activation function for hyper-hill neural networks
S.-W Lee ... C Moraga
-
S.-W Lee, et. al.S.-W Lee ... C Moraga
14 Oct 1996
14 Oct 1996

Resource efficient activation functions for neural network accelerators
Adedamola Wuraola ... Nitish Patel
Neurocomputing | VOL. 482
Adedamola Wuraola, et. al.Adedamola Wuraola ... Nitish Patel
24 Nov 2021
Neurocomputing | VOL. 482

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast Approximations of Activation Functions in Deep Neural Networks when using Posit Arithmetic.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)