Floating-point Arithmetic Operations Research Articles

The rapid advancement in AI requires efficient accelerators for training on edge devices, which often face challenges related to the high hardware costs of floating-point arithmetic operations. To tackle these problems, efficient floating-point formats inspired by block floating-point (BFP), such as Microsoft Floating Point (MSFP) and FlexBlock (FB), are emerging. However, they have limited dynamic range and precision for the smaller magnitude values within a block due to the shared exponent. This limits the BFP's ability to train deep neural networks (DNNs) with diverse datasets. This paper introduces the hybrid precision (HPFP) selection algorithms, designed to systematically reduce precision and implement hybrid precision strategies, thereby balancing layer-wise arithmetic operations and data path precision to address the shortcomings of traditional floating-point formats. Reducing the data bit width with HPFP allows more read/write operations from memory per cycle, thereby decreasing off-chip data access and the size of on-chip memories. Unlike traditional reduced precision formats that use BFP for calculating partial sums and accumulating those partial sums in 32-bit Floating Point (FP32), HPFP leads to significant hardware savings by performing all multiply and accumulate operations in reduced floating-point format. For evaluation, two training accelerators for the YOLOv2-Tiny model were developed, employing distinct mixed precision strategies, and their performance was benchmarked against an accelerator utilizing a conventional brain floating point of 16 bits (Bfloat16). The HPFP selection, employing 10 bits for the data path of all layers and for the arithmetic of layers requiring low precision, along with 12 bits for layers requiring higher precision, results in a 49.4% reduction in energy consumption and a 37.5% decrease in memory access. This is achieved with only a marginal mean Average Precision (mAP) degradation of 0.8% when compared to an accelerator based on Bfloat16. This comparison demonstrates that the proposed accelerator based on HPFP can be an efficient approach to designing compact and low-power accelerators without sacrificing accuracy.

Purpose of this work is to of the research – Increasing the sensitivity of a method for diagnosing phase synchronization of autogenerators based on their non-stationary time series in real time, and also a comparison of the statistical properties of the proposed modification of the method with the well-known method for diagnostics of loop synchronization, which has proven itself in the analysis of experimental data. Methods.The paper compares the probabilities of the appearance of an error of the second kind of the developed modified method for diagnostics of phase synchronization with the probabilities of occurrence of an error of the second kind of the known method at equal values of sensitivity. When comparing the methods, generated test time realizations with a priori known boundaries of the phase synchronization sections are used, which repeat the statistical properties of the experimental data. It also compares the computational complexity of the two methods. Results. A modification of the method for diagnosing phase synchronization of autonomic regulation circuits in real time is proposed. It is shown that the proposed modification provides similar values of sensitivity and probability of appearance of errors of the second kind as the previously proposed approach. The developed method has less computational complexity than the previously proposed method. The values of free parameters corresponding to different values of sensitivity and probability of appearance of errors of the second kind are obtained. Conclusion. The area of application of the developed method with modification is formulated. The low computational complexity of the proposed method, as well as the possibility of switching devices to integer computations in calculations, makes it possible to use it for wearable registrations performing calculations in real time, based on small-sized low-power processors that do not support floating-point arithmetic operations.

Floating-point Arithmetic Operations Research Articles

Related Topics

Articles published on Floating-point Arithmetic Operations

Hybrid Precision Floating-Point (HPFP) Selection to Optimize Hardware-Constrained Accelerator for CNN Training.

Multi-image super-resolution based low complexity deep network for image compressive sensing reconstruction

Optimization of re-configurable multi-core processors and security based on field programmable gate arrays

Max-C and Min-D Projection Auto-Associative Fuzzy Morphological Memories: Theory and an Application for Face Recognition

An efficient design methodology to speed up the FPGA implementation of artificial neural networks

High-speed binary coded decimal digit multipliers with multiple error detection

An exa-scale high-performance molecular dynamics simulation program: MODYLAS.

Implementation of Pipelined Multi_Precision (1, 2 and 4) Floating-point Arithmetic Operations

Vibration compensation of delta 3D printer with position-varying dynamics using filtered B-splines

Enabling In-Network Floating-Point Arithmetic for Efficient Computation Offloading

Increasing the sensitivity of real-time method for diagnostic of autogenerators phase synchronization based on their non-stationary time series

Parallel GPF solution: A GPU‐CPU‐based vectorization parallelization and sparse technique for NR implementation

Rigorous Lower Bounds for the Ground State Energy of Molecules by Employing Necessary N-Representability Conditions.

A multi-GPU implementation of a full-field crystal plasticity solver for efficient modeling of high-resolution microstructures

STUDY AND ANALYSIS OF REMOTE SENSING DATA PARALLEL PROCESSING

Resource Efficient Single Precision Floating Point Multiplier Using Karatsuba Algorithm

Resource Efficient Single Precision Floating Point Multiplier Using Karatsuba Algorithm

Data-Driven Background Subtraction Algorithm for In-Camera Acceleration in Thermal Imagery

Analysis of the precision problem in numerical calculation of the reliability indices of technical systems by the topological method at application of the floating-point arithmetic

Optimized Fundamental Signal Processing Operations For Energy Minimization on Heterogeneous Mobile Devices

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Floating-point Arithmetic Operations Research Articles

Related Topics

Articles published on Floating-point Arithmetic Operations

Hybrid Precision Floating-Point (HPFP) Selection to Optimize Hardware-Constrained Accelerator for CNN Training.

Multi-image super-resolution based low complexity deep network for image compressive sensing reconstruction

Optimization of re-configurable multi-core processors and security based on field programmable gate arrays

Max-C and Min-D Projection Auto-Associative Fuzzy Morphological Memories: Theory and an Application for Face Recognition

An efficient design methodology to speed up the FPGA implementation of artificial neural networks

High-speed binary coded decimal digit multipliers with multiple error detection

An exa-scale high-performance molecular dynamics simulation program: MODYLAS.

Implementation of Pipelined Multi_Precision (1, 2 and 4) Floating-point Arithmetic Operations

Vibration compensation of delta 3D printer with position-varying dynamics using filtered B-splines

Enabling In-Network Floating-Point Arithmetic for Efficient Computation Offloading

Increasing the sensitivity of real-time method for diagnostic of autogenerators phase synchronization based on their non-stationary time series

Parallel GPF solution: A GPU‐CPU‐based vectorization parallelization and sparse technique for NR implementation

Rigorous Lower Bounds for the Ground State Energy of Molecules by Employing Necessary N-Representability Conditions.

A multi-GPU implementation of a full-field crystal plasticity solver for efficient modeling of high-resolution microstructures

STUDY AND ANALYSIS OF REMOTE SENSING DATA PARALLEL PROCESSING

Resource Efficient Single Precision Floating Point Multiplier Using Karatsuba Algorithm

Resource Efficient Single Precision Floating Point Multiplier Using Karatsuba Algorithm

Data-Driven Background Subtraction Algorithm for In-Camera Acceleration in Thermal Imagery

Analysis of the precision problem in numerical calculation of the reliability indices of technical systems by the topological method at application of the floating-point arithmetic

Optimized Fundamental Signal Processing Operations For Energy Minimization on Heterogeneous Mobile Devices