Abstract

This paper proposes a parameterized digital signal processor (DSP) core for an embedded digital signal processing system designed to achieve demodulation/synchronization with better performance and flexibility. The features of this DSP core include parameterizedb data path, dual MAC unit, subword MAC, and optional function-specific blocks for accelerating communication system modulation operations. This DSP core also has a low-power structure, which includes the gray-code addressing mode, pipeline sharing, and advanced hardware looping. Users can select the parameters and special functional blocks based on the character of their applications and then generating a DSP core. The DSP core has been implemented via a cell-based design method using a synthesizable Verilog code with TSMC 0.35 µm SPQM and 0.25 µm 1P5M library. The equivalent gate count of the core area without memory is approximately 50 k. Moreover, the maximum operating frequency of a 16 × 16 version is 100 MHz (0.35 µm) and 140 MHz (0.25 µm).

Highlights

  • During the past few years, digital signal processor (DSP) has become the fastest growing segment in the processor industry [1]

  • The data transfer in the handshaking and merge modes occurs between the data outside the NCU DSP core and the host programmable interface (HPI) memory

  • Regarding the data-flow consideration, a delay register is added between the single MAC path and the second MAC path to create the data source

Read more

Summary

INTRODUCTION

During the past few years, digital signal processor (DSP) has become the fastest growing segment in the processor industry [1]. Very long instruction word (VLIW) and single instruction multiple data (SIMD) approaches can be used to further enhance processor performance [3]. These approaches are not economical for dedicated application in area and power terms. This DSP core supports operations such as scaling, digital FIR filtering (both fixed-coefficient filter for pulse shaping and adaptive filter for equalization), symbol slicing, looping, complex multiplication, and so on.

ARCHITECTURE OF THE DSP CORE
Bus and memory architecture
Pipeline stage
Dual MAC architecture
Subword MAC
A Ndata B Ndata
LOW-POWER DESIGN
Gray-code addressing
Advanced hardware looping
Pipeline sharing
FIR filter function example
Chip verification
CONCLUSIONS
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call