Abstract

Multiplication is the dominant operation for many applications implemented on field-programmable gate arrays (FPGAs). Although most current FPGA families have embedded hard multipliers, soft multipliers using lookup tables (LUTs) in the logic fabric remain important. This paper presents a novel two-operand addition circuit (patent pending) that combines radix-4 partial-product generation with addition and shows how it can be used to implement two’s-complement array multipliers. The circuit is specific to modern Xilinx FPGAs that are based on a 6-input LUT architecture. Proposed pipelined multipliers use 42%–52% fewer LUTs, and some versions can be clocked up to 23% faster than delay-optimized LogiCORE IP multipliers. This allows 1.72–2.10-times as many multipliers to be implemented in the same logic fabric and potentially offers 1.86–2.58-times the throughput by increasing the clock frequency.

Highlights

  • Field-programmable gate arrays (FPGAs) are often used in signal processing systems for many applications, such as digital-signal processing (DSP), video processing and image processing

  • These versions use 47%–52% fewer lookup tables (LUTs) than the baseline LogiCORE multipliers, which allows 1.90–2.10-times as many to be implemented in the same logic fabric

  • This paper presents a novel two-operand adder that combines radix-4 partial-product generation and addition and shows how it can be used in FPGAs based on 6-input LUTs to implement two’s-complement array multipliers

Read more

Summary

Introduction

Field-programmable gate arrays (FPGAs) are often used in signal processing systems for many applications, such as digital-signal processing (DSP), video processing and image processing. Parandeh-Afshar and Ienne discuss the importance of this topic and present techniques to improve the performance of soft multipliers in Altera FPGAs [7]. They present radix-2 Baugh–Wooley multipliers and radix-4-modified Booth multipliers that use generalized parallel counters (GPCs) in the logic fabric to reduce the partial-product matrix to two or three rows, which are added using a carry-propagate adder (CPA). This builds on the previous work of Parandeh-Afshar et al.

Xilinx Logic Fabric
Two-Operand Addition
Altera Logic Fabric
Radix-4-Modified Booth Multipliers
Related Work
Proposed Two-Operand Adder
Partial-Product Selection and Generation
Combined Partial-Product Generation and Addition
Optimizations for the Generate-Add Unit
Array Structure and Pipelining
Row 0 Generate-Add Estimation Unit
Results
Methodology
Single-Cycle Multipliers
Pipelined Multipliers
Layout
GPC-Based Tree Multipliers
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call