Custom Accelerators Research Articles

With the increase in the complexity of models and lack of flexibility offered by the analog computers, coupled with the advancements in digital hardware, the simulation industry has subsequently moved to digital computers and increased usage of programming languages such as C, C++, and MATLAB. However, the reduced time-step required to simulate complex and fast systems imposes a tighter constraint on the time within which the computations have to be performed. The sequential execution of these computations fails to cope with the real-time constraints which further restrict the usefulness of Real-Time Simulation (RTS) in a Virtual Reality (VR) environment. In this paper, we present a methodology for the design and implementation of RTS algorithms, based on the use of Field-Programmable Gate Array (FPGA) technology. We apply our methodology to an 8th order steering valve subsystem of a vehicle with relatively low response time requirements and use the FPGA technology to improve the response time of this model. Our methodology utilizes traditional hardware/software co-design approaches to generate a heterogeneous architecture for an FPGA-based simulator by porting the computationally complex regions to hardware. The hardware design was optimized such that it efficiently utilizes the parallel nature of FPGAs and pipelines the independent operations. Further enhancement was made by building a hardware component library of custom accelerators for common non-linear functions. The library also stores the information about resource utilization, cycle count, and the relative error with different bit-width combinations for these components, which is further used to evaluate different partitioning approaches. In this paper, we illustrate the partitioning of a hardware-based simulator design across dual FPGAs, initiate RTS using a system input from a Hardware-in-the-Loop (HIL) framework, and use these simulation results from our FPGA-based platform to perform response analysis. The total simulation time, which includes the time required to receive the system input over a socket (without HIL), software initialization, hardware computation, and transfer of simulation results back over a socket, shows a speedup of 2 × as compared to a similar setup with no hardware acceleration. The correctness of the simulation output from the hardware has also been validated with the simulated results from the software-only design.

Read full abstract

SUMMARY Heterogeneous multi-core architectures with CPUs andaccelerators attract many attentions since they can achieve power-eﬃcientcomputing in various areas from low-power embedded processing to high-performance computing. Since the optimal architecture is diﬀerent fromapplication to application, ﬁnding the most suitable accelerator is very im-portant. In this paper, we propose an FPGA-based heterogeneous multi-core platform with custom accelerators for power-eﬃcient computing. Us-ing the proposed platform, we evaluate several applications and accelera-tors to identify many key requirements of the applications and propertiesof the accelerators. Such an evaluation is very important to select and op-timize the most suitable accelerator according to the requirements of anapplication to achieve the best performance. key words: heterogeneous multicore processor, FPGA, Multimedia pro-cessing, High-performance-computing 1. Introduction Applications used in low-power embedded processing tohigh performance computing have diﬀerent tasks such asdata-intensive tasks and control-intensive tasks. Therefore,optimal architecture is diﬀerent from application to applica-tion. Heterogeneous multicore processing is proposed to ex-ecute applications power-eﬃciently. It uses diﬀerent proces-sor cores such as CPU cores and accelerator cores as shownin Fig.1. If the tasks of an application are correctly allocatedto the most suitable processor cores, all the cores work to-gether to increase the overall performances.Examples of low-power heterogeneous multi-core pro-cessors are [1] and [2]. The former has multiple coresof CPUs and ALU arrays. The latter has multiple coresof CPUs, a micro-controller and SIMD (single-instructionmultiple-data) type processors. An example of a hetero-geneous high-performance computing is “Tianhe-1A” [3]which has Intel X5670 CPUs and NVDIA GPUs. Com-mercially available heterogeneous multicore processors arepartially programmable so that a part of the data path andcomputations of processing elements (PEs) can be changedto some extent. However, due to the wide variety of tasksand their diﬀerent memory requirements, the programma-bility in commercially available processors is not enough toextract suﬃcient performance. Moreover, the programmingenvironments in various heterogeneous architectures such as

Read full abstract

Custom Accelerators Research Articles

Related Topics

Articles published on Custom Accelerators

A Parallel Connected Component Labeling Architecture for Heterogeneous Systems-on-Chip

Fast and Efficient Convolutional Accelerator for Edge Computing

A framework to generate domain-specific manycore architectures from dataflow programs

Real-time audio signal processing using system-on-chip field programmable gate arrays

Human Lumbar Spine Responses from Vertical Loading: Ranking of Forces Via Brier Score Metrics and Injury Risk Curves.

PDES-A

KNN-STUFF: kNN STreaming Unit for Fpgas

Accelerating Board Games Through Hardware/Software Codesign

IO and data management for infrastructure as a service FPGA accelerators

Abstract WP125: Accelerated 3d Isotropic High-resolution Multi-contrast Intracranial Vessel Wall MRI For Large Artery Stroke Evaluation

A Custom Accelerator for Homomorphic Encryption Applications

Constructive Synthesis of Memory-Intensive Accelerators for FPGA From Nested Loop Kernels

Streaming Elements for FPGA Signal and Image Processing Accelerators

TBES

Real-time simulation of dynamic vehicle models using a high-performance reconfigurable platform

Data-Transfer-Aware Design of an FPGA-Based Heterogeneous Multicore Platform with Custom Accelerators

Comparison of High Level FPGA Hardware Design for Solving Tri-diagonal Linear Systems

Evaluation of an FPGA-Based Heterogeneous Multicore Platform with SIMD/MIMD Custom Accelerators

Real-time Simulation of Dynamic Vehicle Models using a High-performance Reconfigurable Platform

Exploiting Heterogeneity for Energy Efficiency in Chip Multiprocessors

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Custom Accelerators Research Articles

Related Topics

Articles published on Custom Accelerators

A Parallel Connected Component Labeling Architecture for Heterogeneous Systems-on-Chip

Fast and Efficient Convolutional Accelerator for Edge Computing

A framework to generate domain-specific manycore architectures from dataflow programs

Real-time audio signal processing using system-on-chip field programmable gate arrays

Human Lumbar Spine Responses from Vertical Loading: Ranking of Forces Via Brier Score Metrics and Injury Risk Curves.

PDES-A

KNN-STUFF: kNN STreaming Unit for Fpgas

Accelerating Board Games Through Hardware/Software Codesign

IO and data management for infrastructure as a service FPGA accelerators

Abstract WP125: Accelerated 3d Isotropic High-resolution Multi-contrast Intracranial Vessel Wall MRI For Large Artery Stroke Evaluation

A Custom Accelerator for Homomorphic Encryption Applications

Constructive Synthesis of Memory-Intensive Accelerators for FPGA From Nested Loop Kernels

Streaming Elements for FPGA Signal and Image Processing Accelerators

TBES

Real-time simulation of dynamic vehicle models using a high-performance reconfigurable platform

Data-Transfer-Aware Design of an FPGA-Based Heterogeneous Multicore Platform with Custom Accelerators

Comparison of High Level FPGA Hardware Design for Solving Tri-diagonal Linear Systems

Evaluation of an FPGA-Based Heterogeneous Multicore Platform with SIMD/MIMD Custom Accelerators

Real-time Simulation of Dynamic Vehicle Models using a High-performance Reconfigurable Platform

Exploiting Heterogeneity for Energy Efficiency in Chip Multiprocessors