Field Programmable Gate Array Device Research Articles

With rapidly developing high-speed wireless communications, the 60 GHz millimeter-wave (mm-wave) frequency range has attracted extensive interests, and radio-over-fiber (RoF) systems have been widely investigated as a promising solution to deliver mm-wave signals. Neural networks have been proposed and studied to improve the mm-wave RoF system performances at the receiver side by suppressing both linear and nonlinear impairments. However, previous studies of neural networks in mm-wave RoF systems all focus on the use of off-line processing with high-end GPUs or CPUs, which are not practical for low power-consumption, low-cost and limited computation platform applications. To solve this issue, in this paper we investigate neural network hardware accelerator implementations for mm-wave RoF systems for the first time using the field programmable gate array (FPGA), taking advantage of the low power consumption, parallel computation, and reconfigurablity features of FPGA. Both the convolutional neural network (CNN) and binary convolutional neural network (BCNN) hardware accelerators are demonstrated. In addition, to satisfy the low-latency requirement in mm-wave RoF systems and to enable the use of low-cost compact FPGA devices, a novel inner parallel computation optimization method for implementing CNN and BCNN on FPGA is proposed. It is shown that compared with the popular embedded processor (ARM Cortex A9) execution latency, the proposed FPGA-based hardware accelerator reduces the processing delay in mm-wave RoF systems by about 99.45% and 92.79% for CNN and BCNN, respectively. Compared with non-optimized FPGA implementations, results show that the proposed inner parallel computation method reduces the processing latency by about 44.93% and 45.85% for CNN and BCNN, respectively. In addition, compared with the GPU implementation, the latency of CNN implementation with the proposed optimization method is reduced by 85.49%, while the power consumption is reduced by 86.91%. Although the latency of BCNN implementation with the proposed optimization method is larger compared with the GPU implementation, the power consumption is reduced by 86.14%. The demonstrated FPGA-based neural network hardware accelerators provide a promising solution for mm-wave RoF systems.

Read full abstract

There is an enormous demand for high speed data communication or high speed internet using Long Term Evolution (LTE) or LTE-Advanced communication methods. To achieve the high speed data rate in the receiver side, it is necessary to achieve high throughput in the Fast Fourier transform (FFT) architecture. Hence there is demand to improve the throughput of the FFT architectures used for high speed data communication. The FFT architecture is designed and optimized for LTE-A applications. However, the high throughput has been achieved by sacrificing hardware resources of the Field Programmable Gate Array (FPGA). Many fixed and variable length FFT processors are proposed by the researchers with improved performance focusing either on algorithmic modifications, novel architectural optimizations or radix selection. Among these FFT processors, the goal and main objectives of this paper are: 1. To design and implement a pipelined FFT architecture to give high throughput for LTE-A MIMO applications 2. To develop an intellectual property (IP) core for FFT computation with variable FFT size 3. To propose a parallel implementation to increase the performance of LTE-A baseband processing system. In an Orthogonal Frequency Division Multiplexing (OFDM) baseband communication system, FFT operation is one of the highest computationally serious tasks which directly influence the communication system performance factor. The baseband hardware has to be capable as well as efficient enough to calculate FFT in specific timing restrictions. Thus, the parallel pipelined multi radix Variable Length FFT architectures for LTE-A Multiple-Input-Multiple-Output (MIMO) applications have been designed. The proposed FFT architecture delivers a throughput of 550 MSPS at the maximum clock rate of 550 MHz. The proposed FFT support the FFT length of 64, 128, 256, 512, 1024 and 2048 for LTE-Advanced MIMO standard. The proposed FFT architecture has been implemented in Xilinx Artix-7 FPGA device and the performance metrics have been analysed. Even though the proposed FFT architecture consumes additional Block-RAMs (BRAM) and quite an amount of Xilinx-Xtreme Digital Signal Processor (DSP)/ DSP48 resources, the Power Delay Product (PDP) of the proposed FFT is excellent compared to the existing FFT architectures. The proposed multi radix mixed 2/3/5 parallel pipelined 2048 point FFT architectures exhibits very less latency of 11.382 µs at 550 MHz clock frequency compared to the existing system with the latency of 56.88 µs at 200 MHz clock frequency. Moreover, the designed FFT architecture utilizes 96 Xilinx Xtreme DSP blocks 25040 clock cycles to complete the FFT operation with an excellent throughput of 2200 MSPS with FFT computation time of 45.528 µs at the clock frequency of 550 MHz for the 4x4 MIMO- OFDM architecture. Therefore, the designed FFT provides high throughput rate in order to meet the modern wireless standard specifications. The proposed FFT architecture outperforms well in terms high throughput, low latency and better PDP with extra hardware as trade-off for both for Single FFT and for MIMO technology.

Read full abstract

Field Programmable Gate Array Device Research Articles

Related Topics

Articles published on Field Programmable Gate Array Device

A fast and scalable architecture to run convolutional neural networks in low density FPGAs

Area and power‐efficient variable‐length fast Fourier transform for MR‐OFDM physical layer of IEEE 802.15.4‐g

FPGA-based neural network accelerators for millimeter-wave radio-over-fiber systems.

A One-Cycle Correction Error-Resilient Flip-Flop for Variation-Tolerant Designs on an FPGA

Efficient Implementation of Multiple Time Coding Lines-Based TDC in an FPGA Device

Dual-Mode FPGA-Based Triple-TDC With Real-Time Calibration and a Triple Modular Redundancy Scheme

A Novel High-Precision Synchronous Sampling Mechanism for Distributed Test System Based on Optical Fiber Network and Low-Cost Field-Programmable Gate Array

PARALLEL PIPELINED MULTI RADIX VARIABLE LENGTH FAST FOURIER TRANSFORM ARCHITECTURE

Dynamic partial reconfiguration enchanced with security system for reduced area and low power consumption

Parallel architecture of power‐of‐two multipliers for FPGAs

Low-Complexity Nonlinear Self-Inverse Permutation for Creating Physically Clone-Resistant Identities

Advanced superior execution time optimal time-frequency filter suitable for non-linear FM signals estimation

Dynamic Bus Voltage Reconfiguration in a Two-Stage Multiphase Converter for Fast Transient

Towards cloud energy metering system with 32 bit FPGA device architecture

Bio-Inspired Approaches to Safety and Security in IoT-Enabled Cyber-Physical Systems.

Spin Me Right Round Rotational Symmetry for FPGA-Specific AES: Extended Version

DSP-Efficient Hardware Acceleration of Convolutional Neural Network Inference on FPGAs

Measurement of the PMSM Current with a Current Transducer with DSP and FPGA

FPGA-Based Lightweight Hardware Architecture of the PHOTON Hash Function for IoT Edge Devices

A hybrid bio-inspired optimisation approach for wirelength minimisation of hardware tasks placement in field programmable gate array devices

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Field Programmable Gate Array Device Research Articles

Related Topics

Articles published on Field Programmable Gate Array Device

A fast and scalable architecture to run convolutional neural networks in low density FPGAs

Area and power‐efficient variable‐length fast Fourier transform for MR‐OFDM physical layer of IEEE 802.15.4‐g

FPGA-based neural network accelerators for millimeter-wave radio-over-fiber systems.

A One-Cycle Correction Error-Resilient Flip-Flop for Variation-Tolerant Designs on an FPGA

Efficient Implementation of Multiple Time Coding Lines-Based TDC in an FPGA Device

Dual-Mode FPGA-Based Triple-TDC With Real-Time Calibration and a Triple Modular Redundancy Scheme

A Novel High-Precision Synchronous Sampling Mechanism for Distributed Test System Based on Optical Fiber Network and Low-Cost Field-Programmable Gate Array

PARALLEL PIPELINED MULTI RADIX VARIABLE LENGTH FAST FOURIER TRANSFORM ARCHITECTURE

Dynamic partial reconfiguration enchanced with security system for reduced area and low power consumption

Parallel architecture of power‐of‐two multipliers for FPGAs

Low-Complexity Nonlinear Self-Inverse Permutation for Creating Physically Clone-Resistant Identities

Advanced superior execution time optimal time-frequency filter suitable for non-linear FM signals estimation

Dynamic Bus Voltage Reconfiguration in a Two-Stage Multiphase Converter for Fast Transient

Towards cloud energy metering system with 32 bit FPGA device architecture

Bio-Inspired Approaches to Safety and Security in IoT-Enabled Cyber-Physical Systems.

Spin Me Right Round Rotational Symmetry for FPGA-Specific AES: Extended Version

DSP-Efficient Hardware Acceleration of Convolutional Neural Network Inference on FPGAs

Measurement of the PMSM Current with a Current Transducer with DSP and FPGA

FPGA-Based Lightweight Hardware Architecture of the PHOTON Hash Function for IoT Edge Devices

A hybrid bio-inspired optimisation approach for wirelength minimisation of hardware tasks placement in field programmable gate array devices