Arithmetic Blocks Research Articles

Nonvolatile memory (NVM)-based computing in-memory (CIM) is a promising solution to data-intensive applications. This work proposes a 2T2R resistive random access memory (ReRAM) architecture that supports three types of CIM operations: 1) ternary content addressable memory (TCAM); 2) logic in-memory (LiM) primitives and arithmetic blocks such as full adder (FA) and full subtractor; and 3) in-memory dot-product for neural networks. The proposed architecture allows the NVM operations in both 2T2R and conventional 1T1R configurations. The proposed LiM full adder (LiM-FA) improves the delay, the static power, and the dynamic power by <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$3.2\times $ </tex-math></inline-formula> , <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$1.2\times $ </tex-math></inline-formula> , and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$1.6\times $ </tex-math></inline-formula> , respectively, compared with state-of-the-art LiM-FAs. Furthermore, based on different optimization techniques and robustness analysis, a lower precharge voltage is set for each mode. This reduces the TCAM search energy and 1T1R ReRAM access energy by <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$1.6\times $ </tex-math></inline-formula> and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$1.14\times $ </tex-math></inline-formula> , respectively, compared with the case without optimizations.

High Definition (HD) image processing and real-time analytics over live video feeds have always been the key requirements for Intelligence, Surveillance and Reconnaissance (ISR) applications. With the evolution of optics and image enhancement techniques, computational loads of HD ISR systems are also rising exponentially. On the contrary, the slow-down of Moore's Law has recently posed challenging bounds over the level of achievable miniaturization for emerging processing and storage units. Field Programmable Gate Arrays (FPGAs) offer a popular choice of implementing ISR algorithms over resource-constrained platforms, such as Unmanned Aerial Vehicles (UAVs), due to favorable features of reconfigurability and rapid prototyping. A promising solution to bridge the gap between resource-constrained host platforms and computation-intensive FPGA applications is the paradigm of Approximate Computing. It compromises on the accuracy of processed results to offer significant performance gains for error-tolerant applications, such as video and image processing. In this paper, we present a novel approximate adder design methodology, for FPGA-based systems with improved SWaP performance, besides preserving the accuracy requirements within acceptable thresholds. The design methodology proposed in this paper focuses on the FPGA-specific Look-Up Table (LUT) architecture to introduce approximations while splitting the carry chain into LUT-based sub-adders, with flexible overlap to tune the adder's accuracy and achieve the overall latency of a single LUT. The paper presents several variants of the proposed design and offers application-oriented flexibility to adjust for optimal SWaP vs accuracy trade-off. We have further devised a comprehensive assessment approach to verify functional viability of the proposed atomic arithmetic blocks at system level, through their implementation into dense computational imaging applications, such as 2-dimensional Discrete Cosine Transform (DCT), airborne self-localization and moving object tracking algorithms, in comparison with other state-of-the-art adders. Our most accurate design performs at least 9.9% better in power consumption when compared with existing approximate adders, which proves that the proposed methodology holds promising potential to improve SWaP-index for computation-intensive UAV applications.

Arithmetic Blocks Research Articles

Related Topics

Articles published on Arithmetic Blocks

A Hardware Implementation of the PID Algorithm Using Floating-Point Arithmetic

Efficient realization of quantum balanced ternary reversible multiplier building blocks: A great step towards sustainable computing

Efficient Arithmetic Block Identification With Graph Learning and Network-Flow

Support vector machines implementation over integers modulo-M and Residue Number System

Implementation of Hardware and Energy Efficient Approximate Multiplier Architectures Using 4-2 Compressor for Images

Energy-Efficient Hardware Implementation of Fully Connected Artificial Neural Networks Using Approximate Arithmetic Blocks.

A High-Speed Low-Energy One-Trit Ternary Multiplier Circuit Design in CNTFET Technology

Process Variability Analysis in Interconnect, Logic, and Arithmetic Blocks of 16-nm FinFET FPGAs

A scalable high‐speed hybrid 1‐bit full adder design using XOR‐XNOR module

A Low Area FPGA Implementation of Reversible Gate Encryption with Heterogeneous Key Generation

Design and Analysis of Hybrid full adder Topology using Regular and Triplet Logic Design

Reconfigurable 2T2R ReRAM Architecture for Versatile Data Storage and Computing In-Memory

Cognitive and balance impairments in people with incidental white matter hyperintensities

Design of Ultra-Low Power Consumption Approximate 4–2 Compressors Based on the Compensation Characteristic

Arithmetic Sequences and Blocks of Powers of Two in the Collatz Array

XUAVs: Towards Efficient Approximate Computing for UAVs—Low Power Approximate Adders With Single LUT Delay for FPGA-Based Aerial Imaging Optimization

Functional Demonstration of a Memristive Arithmetic Logic Unit (MemALU) for In‐Memory Computing

Physically Aware Affinity-Driven Multiplier Implementation

Toward efficient implementation of basic balanced ternary arithmetic operations in CNFET technology

Ultra‐low‐voltage GDI‐based hybrid full adder design for area and energy‐efficient computing systems

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Arithmetic Blocks Research Articles

Related Topics

Articles published on Arithmetic Blocks

A Hardware Implementation of the PID Algorithm Using Floating-Point Arithmetic

Efficient realization of quantum balanced ternary reversible multiplier building blocks: A great step towards sustainable computing

Efficient Arithmetic Block Identification With Graph Learning and Network-Flow

Support vector machines implementation over integers modulo-M and Residue Number System

Implementation of Hardware and Energy Efficient Approximate Multiplier Architectures Using 4-2 Compressor for Images

Energy-Efficient Hardware Implementation of Fully Connected Artificial Neural Networks Using Approximate Arithmetic Blocks.

A High-Speed Low-Energy One-Trit Ternary Multiplier Circuit Design in CNTFET Technology

Process Variability Analysis in Interconnect, Logic, and Arithmetic Blocks of 16-nm FinFET FPGAs

A scalable high‐speed hybrid 1‐bit full adder design using XOR‐XNOR module

A Low Area FPGA Implementation of Reversible Gate Encryption with Heterogeneous Key Generation

Design and Analysis of Hybrid full adder Topology using Regular and Triplet Logic Design

Reconfigurable 2T2R ReRAM Architecture for Versatile Data Storage and Computing In-Memory

Cognitive and balance impairments in people with incidental white matter hyperintensities

Design of Ultra-Low Power Consumption Approximate 4–2 Compressors Based on the Compensation Characteristic

Arithmetic Sequences and Blocks of Powers of Two in the Collatz Array

XUAVs: Towards Efficient Approximate Computing for UAVs—Low Power Approximate Adders With Single LUT Delay for FPGA-Based Aerial Imaging Optimization

Functional Demonstration of a Memristive Arithmetic Logic Unit (MemALU) for In‐Memory Computing

Physically Aware Affinity-Driven Multiplier Implementation

Toward efficient implementation of basic balanced ternary arithmetic operations in CNFET technology

Ultra‐low‐voltage GDI‐based hybrid full adder design for area and energy‐efficient computing systems