Design and Application Space Exploration of a Domain-Specific Accelerator System

Fan Feng,Hongbing Pan,Guoqiang He,Li Li,Yuxiang Fu,Kun Wang

doi:10.3390/electronics7040045

Fan Feng, Hongbing Pan + Show 4 more

Open Access

https://doi.org/10.3390/electronics7040045

Copy DOI

Journal: Electronics	Publication Date: Mar 29, 2018
Citations: 3	License type: CC BY 4.0

Affiliation: Nanjing University, Institute of Electronics

Abstract

Domain-specific accelerators are a reaction adapting to device scaling and the dark silicon era. This paper describes a radar signal processing oriented configurable accelerator and the application space exploration of the system. The system is built around accelerator engines and general-purpose processors (GPPs) that make it suitable for intensive computing kernel acceleration and complex control tasks. It is geared toward high-performance radar digital signal processing; we characterize the applications and find that each of them contains a series of serializable kernels. Taking advantage of this discovery, we design an algorithm pool that shares the same computation resource and memory resource, and each algorithm is size reconfigurable. On the other hand, shared on-chip addressable scratchpad memory eliminates unnecessary explicit data copy between accelerators. Performance of the system is evaluated from measurements performed both on an FPGA SoC test chip and on a prototype chip fabricated by CMOS 40 nm technology. The experimental results show that for different algorithms, the proposed system achieves 1.9× to 10.1× performance gain compared with a state-of-the-art TI DSP chip. In order to characterize the application of the system, a complex real-life task is adopted, and the results show that it can obtain high throughput and desirable precision.

Highlights

The ever-increasing computational requirements in modern digital signal processing force the systems to obtain higher processing capacity and higher throughput simultaneously.Various architecture alternatives are capable of performing these tasks.Due to their high volume, powerful general purpose processors are a straightforward way to accelerate digital signal processing, but such processors must consume a large fraction of transistors to flexibly support diverse application domains, can often be inefficient for specific workloads.On the other hand, in the era of diminishing returns from technology scaling, fixed power budgets and the end of Dennard scaling [1,2] have led researchers to embrace accelerators in order to sustain performance and energy efficiency increases
All the test data of reconfigurable application specified processor (RASP) is obtained by running real-life tasks in the prototype chip
As for the TI DSP, which has similar computation resources compared with RASP, the RASP can achieve 2.13x speedup

Summary

Introduction

The ever-increasing computational requirements in modern digital signal processing force the systems to obtain higher processing capacity and higher throughput simultaneously. The other class of customized accelerators, implemented as ASICs demonstrated substantial advantages in performance and power over the instruction driven general-purpose processors, and they can perform key kernel computations to achieve orders of magnitude of improvements in performance and energy efficiency. They are integral components in modern SoCs, powering a variety of applications like video codec [3], data processing [4,5], and deep learning [6]. A conclusion of the paper is presented in the last section

System Architecture Overview

Workload Analysis

Compilation Workflow

Result data outputting

Accelerator Gain Analysis

Data Interface Latency Analysis

Algorithm Speedup Ratio

Comprehensive Analysis

Test Case Application

Prototype Verification Environment

Results

Performance Analysis

Precision Analysis

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Design and Application Space Exploration of a Domain-Specific Accelerator System

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Instrumentation for Wide Bandwidth Radio Astronomy

-

01 Jan 2009
01 Jan 2009

PN-generators embedded in high performance signal processors
U Walther ... G.P Ferrweis
-
U Walther, et. al.U Walther ... G.P Ferrweis
06 May 2001
06 May 2001

Optimizing Data Placement of Loops for Energy Minimization with Multiple Types of Memories
Jun Zhang ... Edwin H.-M Sha
Journal of Signal Processing Systems | VOL. 72
Jun Zhang, et. al.Jun Zhang ... Edwin H.-M Sha
28 Jun 2013
Journal of Signal Processing Systems | VOL. 72

Radar Signal Analysis and Processing Using MATLAB
Bassem R Mahafza
-
Bassem R MahafzaBassem R Mahafza
19 Apr 2016
19 Apr 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Design and Application Space Exploration of a Domain-Specific Accelerator System

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics