Significant Hardware Savings Research Articles

The rapid advancement in AI requires efficient accelerators for training on edge devices, which often face challenges related to the high hardware costs of floating-point arithmetic operations. To tackle these problems, efficient floating-point formats inspired by block floating-point (BFP), such as Microsoft Floating Point (MSFP) and FlexBlock (FB), are emerging. However, they have limited dynamic range and precision for the smaller magnitude values within a block due to the shared exponent. This limits the BFP's ability to train deep neural networks (DNNs) with diverse datasets. This paper introduces the hybrid precision (HPFP) selection algorithms, designed to systematically reduce precision and implement hybrid precision strategies, thereby balancing layer-wise arithmetic operations and data path precision to address the shortcomings of traditional floating-point formats. Reducing the data bit width with HPFP allows more read/write operations from memory per cycle, thereby decreasing off-chip data access and the size of on-chip memories. Unlike traditional reduced precision formats that use BFP for calculating partial sums and accumulating those partial sums in 32-bit Floating Point (FP32), HPFP leads to significant hardware savings by performing all multiply and accumulate operations in reduced floating-point format. For evaluation, two training accelerators for the YOLOv2-Tiny model were developed, employing distinct mixed precision strategies, and their performance was benchmarked against an accelerator utilizing a conventional brain floating point of 16 bits (Bfloat16). The HPFP selection, employing 10 bits for the data path of all layers and for the arithmetic of layers requiring low precision, along with 12 bits for layers requiring higher precision, results in a 49.4% reduction in energy consumption and a 37.5% decrease in memory access. This is achieved with only a marginal mean Average Precision (mAP) degradation of 0.8% when compared to an accelerator based on Bfloat16. This comparison demonstrates that the proposed accelerator based on HPFP can be an efficient approach to designing compact and low-power accelerators without sacrificing accuracy.

Read full abstract

The rapid growth of the World-Wide Web poses unprecedented scaling challenges for general-purpose crawlers and search engines. In this paper we describe a new hypertext resource discovery system called a Focused Crawler. The goal of a focused crawler is to selectively seek out pages that are relevant to a pre-defined set of topics. The topics are specified not using keywords, but using exemplary documents. Rather than collecting and indexing all accessible Web documents to be able to answer all possible ad-hoc queries, a focused crawler analyzes its crawl boundary to find the links that are likely to be most relevant for the crawl, and avoids irrelevant regions of the Web. This leads to significant savings in hardware and network resources, and helps keep the crawl more up-to-date. To achieve such goal-directed crawling, we designed two hypertext mining programs that guide our crawler: a classifier that evaluates the relevance of a hypertext document with respect to the focus topics, and a distiller that identifies hypertext nodes that are great access points to many relevant pages within a few links. We report on extensive focused-crawling experiments using several topics at different levels of specificity. Focused crawling acquires relevant pages steadily while standard crawling quickly loses its way, even though they are started from the same root set. Focused crawling is robust against large perturbations in the starting set of URLs. It discovers largely overlapping sets of resources in spite of these perturbations. It is also capable of exploring out and discovering valuable resources that are dozens of links away from the start set, while carefully pruning the millions of pages that may lie within this same radius. Our anecdotes suggest that focused crawling is very effective for building high-quality collections of Web documents on specific topics, using modest desktop hardware.

Read full abstract

Significant Hardware Savings Research Articles

Articles published on Significant Hardware Savings

Hybrid Precision Floating-Point (HPFP) Selection to Optimize Hardware-Constrained Accelerator for CNN Training.

Enhanced MAC controller architecture for 2D processing based on FPGA with configurable resource allocation

Directionally-Unbiased Unitary Optical Devices in Discrete-Time Quantum Walks

Optimal Complexity Architectures for Pipelined Distributed Arithmetic-Based LMS Adaptive Filter

AREA REDUCTION TECHNIQUES FOR PARALLEL FIR FILTER WITH SYMMETRIC COEFFICIENTS.

A Frame Work for Topical Collections Make with Focused and Accelerated Focused Crawlers

Focused Crawler based on Efficient Page Rank Algorithm

An approximate logarithmic squaring circuit with error compensation for DSP applications

A Novel Fast FIR Algorithm for Area-Efficient Parallel FIR Digital Filter Structures Utilizes Symmetric Convolutions

Area-Efficient VLSI Implementation for Parallel Linear-Phase FIR Digital Filters of Odd Length Based on Fast FIR Algorithm

Efficient Complexity Reduction Technique for Parallel FIR Digital Filter based on Fast FIR Algorithm

Area-Efficient VLSI Implementation for Parallel Linear-Phase FIR Digital Filters of Odd Length Based on Fast FIR Algorithm

Area-Efficient Parallel FIR Digital Filter Structures for Symmetric Convolutions Based on Fast FIR Algorithm

An Efficient and Practical Decoupling Feeding Network for Antenna Phased Arrays

Low- Cost Parallel FIR Filter Structures With 2-Stage Parallelism

Sensor array beamforming using random channel sampling: the aggregate beamformer.

Focused crawling: a new approach to topic-specific Web resource discovery

The Precomputed-Branch architecture: Efficient branches with compiler support

The roles of FPGAs in reprogrammable systems

Efficient FFT structures using a mixed complex number system

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Significant Hardware Savings Research Articles

Articles published on Significant Hardware Savings

Hybrid Precision Floating-Point (HPFP) Selection to Optimize Hardware-Constrained Accelerator for CNN Training.

Enhanced MAC controller architecture for 2D processing based on FPGA with configurable resource allocation

Directionally-Unbiased Unitary Optical Devices in Discrete-Time Quantum Walks

Optimal Complexity Architectures for Pipelined Distributed Arithmetic-Based LMS Adaptive Filter

AREA REDUCTION TECHNIQUES FOR PARALLEL FIR FILTER WITH SYMMETRIC COEFFICIENTS.

A Frame Work for Topical Collections Make with Focused and Accelerated Focused Crawlers

Focused Crawler based on Efficient Page Rank Algorithm

An approximate logarithmic squaring circuit with error compensation for DSP applications

A Novel Fast FIR Algorithm for Area-Efficient Parallel FIR Digital Filter Structures Utilizes Symmetric Convolutions

Area-Efficient VLSI Implementation for Parallel Linear-Phase FIR Digital Filters of Odd Length Based on Fast FIR Algorithm

Efficient Complexity Reduction Technique for Parallel FIR Digital Filter based on Fast FIR Algorithm

Area-Efficient VLSI Implementation for Parallel Linear-Phase FIR Digital Filters of Odd Length Based on Fast FIR Algorithm

Area-Efficient Parallel FIR Digital Filter Structures for Symmetric Convolutions Based on Fast FIR Algorithm

An Efficient and Practical Decoupling Feeding Network for Antenna Phased Arrays

Low- Cost Parallel FIR Filter Structures With 2-Stage Parallelism

Sensor array beamforming using random channel sampling: the aggregate beamformer.

Focused crawling: a new approach to topic-specific Web resource discovery

The Precomputed-Branch architecture: Efficient branches with compiler support

The roles of FPGAs in reprogrammable systems

Efficient FFT structures using a mixed complex number system