Digit-recurrence Algorithm Research Articles

Most digit-recurrence algorithms for division, such as the Sweeney–Robertson–Tocher (SRT) algorithm, have been used in order to take advantage of the redundant representations of the partial remainder. This way, full carry propagate additions are avoided, obtaining significant latency improvements. Furthermore, the delay corresponding to one division iteration is independent of the size of the operands. The most frequent redundant form for the partial remainders is the carry-save (CS) representation, which uses 2 bits of representation (carry and sum bits) for each bit of the partial remainder. This paper proposes radix-4 SRT dividers which use (3, 2) redundancy (3 bits of representation for 2 bits of the partial remainder) and (5, 4) redundancy (5 bits of representation for 4 bits of the partial remainder). The goal of using these representations is represented by a decreased cost due to the reduced number of sequential elements required to store the partial remainder. The proposed dividers use 2-bit carry propagate adders and 4-bit carry propagate adders to compute the new partial remainder. Thus, the full carry propagate addition is avoided, while the latency of one division iteration is independent of the operands’ size. The synthesis result for Xilinx Virtex-5 FPGA devices show that similar working frequencies are obtained for divider using the proposed redundant representation with respect to the conventional carry-save, while requiring up to 12% for (3, 2) representation and 18% for (5, 4) representation less sequential elements.

Read full abstract

Given the popularity of decimal arithmetic, hardware implementation of decimal operations has been a hot topic of research in recent decades. Besides the four basic operations, the square root can be implemented as an instruction directly in the hardware, which improves the performance of the decimal floating-point unit in the processors. Hardware implementation of decimal square rooters is usually done using either functional or digit-recurrence algorithms. Functional algorithms, entailing multiplication per iteration, seem inadequate to use for decimal square roots, given the high cost of decimal multipliers. On the other hand, digit-recurrence square root algorithms, particularly SRT (this method is named after its creators, Sweeney, Robertson, and Tocher) algorithms, are simple and well suited for decimal arithmetic. This paper, with the intention of reducing the latency of the decimal square root operation while maintaining a reasonable cost, proposes an SRT algorithm and the corresponding hardware architecture to compute the decimal square root. The proposed fixed-point square root design requires n+3 cycles to compute an n-digit root; the synthesis results show an area cost of about 31K NAND2 and a cycle time of 40 FO4. These results reveal the 14 % speed advantage of the proposed decimal square root architecture over the fastest previous work (which uses a functional algorithm) with about a quarter of the area.

Read full abstract

Digit-recurrence Algorithm Research Articles

Articles published on Digit-recurrence Algorithm

Radix-64 Floating-Point Division and Square Root: Iterative and Pipelined Units

Low Latency Floating-Point Division and Square Root Unit

On the Redundant Representation of Partial Remainders in Radix-4 SRT Dividers

HARDWARE IMPLEMENTATION OF METHODOLOGIES OF FIXED POINT DIVISION ALGORITHMS

Decimal Division Algorithms: The Issue of Partial Remainders

Decimal SRT Square Root: Algorithm and Architecture

Analysis of Fast Radix-10 Digit Recurrence Algorithms for Fixed-Point and Floating-Point Dividers on FPGAs

FPGA based High Speed Double Precision Floating Point Divider

Decimal floating-point antilogarithmic converter based on selection by rounding: algorithm and architecture

A Radix-16 Combined Complex Division/Square Root Unit with Operand Prescaling

Power Efficient Division and Square Root Unit

Improved Decimal Floating-Point Logarithmic Converter Based on Selection by Rounding

Design Issues and Implementations for Floating-Point Divide–Add Fused

A Radix-2 Digit-by-Digit Architecture for Cube Root

A Digit-by-Digit Algorithm for mth Root Extraction

A Radix-10 Digit-Recurrence Division Unit: Algorithm and Architecture

Complex Square Root with Operand Prescaling

Hardware Algorithm for Computing Reciprocal of Euclidean Norm of a 3-D Vector

High-Radix Logarithm with Selection by Rounding: Algorithm and Implementation

Implementation of the Exponential Function in a Floating-Point Unit

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Digit-recurrence Algorithm Research Articles

Articles published on Digit-recurrence Algorithm

Radix-64 Floating-Point Division and Square Root: Iterative and Pipelined Units

Low Latency Floating-Point Division and Square Root Unit

On the Redundant Representation of Partial Remainders in Radix-4 SRT Dividers

HARDWARE IMPLEMENTATION OF METHODOLOGIES OF FIXED POINT DIVISION ALGORITHMS

Decimal Division Algorithms: The Issue of Partial Remainders

Decimal SRT Square Root: Algorithm and Architecture

Analysis of Fast Radix-10 Digit Recurrence Algorithms for Fixed-Point and Floating-Point Dividers on FPGAs

FPGA based High Speed Double Precision Floating Point Divider

Decimal floating-point antilogarithmic converter based on selection by rounding: algorithm and architecture

A Radix-16 Combined Complex Division/Square Root Unit with Operand Prescaling

Power Efficient Division and Square Root Unit

Improved Decimal Floating-Point Logarithmic Converter Based on Selection by Rounding

Design Issues and Implementations for Floating-Point Divide–Add Fused

A Radix-2 Digit-by-Digit Architecture for Cube Root

A Digit-by-Digit Algorithm for mth Root Extraction

A Radix-10 Digit-Recurrence Division Unit: Algorithm and Architecture

Complex Square Root with Operand Prescaling

Hardware Algorithm for Computing Reciprocal of Euclidean Norm of a 3-D Vector

High-Radix Logarithm with Selection by Rounding: Algorithm and Implementation

Implementation of the Exponential Function in a Floating-Point Unit