Design of Fully Spectral CNNs for Efficient FPGA-Based Acceleration.

Shuanglong Liu,Hongxiang Fan,Wayne Luk

doi:10.1109/tnnls.2022.3224779

Abstract

Computing convolutional layers in the frequency domain using fast Fourier transformation (FFT) has been demonstrated to be effective in reducing the computational complexity of convolutional neural networks (CNNs). Nevertheless, the main challenge of this approach lies in the frequent and repeated transformations between the spatial and frequency domains due to the absence of nonlinear functions in the spectral domain, as such it makes the benefit less attractive for low-latency inference, especially on embedded platforms. To overcome the drawbacks in the existing FFT-based convolution, we propose a fully spectral CNN using a novel spectral-domain adaptive rectified linear unit (ReLU) layer, which completely removes the compute-intensive transformations between the spatial and frequency domains within the network. The proposed fully spectral CNNs maintain the nonlinearity of the spatial CNNs while taking into account the hardware efficiency. We then propose a deeply customized and compute-efficient hardware architecture to accelerate the fully spectral CNN inference on field programmable gate array (FPGA). Different hardware optimizations, such as spectral-domain intralayer and interlayer pipeline techniques, are introduced to further improve the performance of throughput. To achieve a load-balanced pipeline, a design space exploration (DSE) framework is proposed to optimize the resource allocation between hardware modules according to the resource constraints. On an Intel's Arria 10 SX160 FPGA, our optimized accelerator achieves a throughput of 204 Gop/s with 80% of compute efficiency. Compared with the state-of-the-art spatial and FFT-based implementations on the same device, our accelerator is 4×∼ 6.6× and 3.0×∼ 4.4× faster while maintaining a similar level of accuracy across different benchmark datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Design of Fully Spectral CNNs for Efficient FPGA-Based Acceleration.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems

Lead the way for us

Journal: IEEE transactions on neural networks and learning systems	Publication Date: Jun 1, 2024
Citations: 3

Similar Papers

Accelerating Fully Spectral CNNs with Adaptive Activation Functions on FPGA
Shuanglong Liu ... Hongxiang Fan
-
Shuanglong Liu, et. al.Shuanglong Liu ... Hongxiang Fan
01 Feb 2021
01 Feb 2021

DASH: Design Automation for Synthesis and Hardware Generation for CNN
Arish Sateesan ... Smitha K G
-
Arish Sateesan, et. al.Arish Sateesan ... Smitha K G
01 Dec 2020
01 Dec 2020

Evaluating Fast Algorithms for Convolutional Neural Networks on FPGAs
Yun Liang ... Shengen Yan
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 39
Yun Liang, et. al.Yun Liang ... Shengen Yan
21 Feb 2019
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 39

Leveraging Fine-grained Structured Sparsity for CNN Inference on Systolic Array Architectures
Linqiao Liu ... Stephen Brown
-
Linqiao Liu, et. al.Linqiao Liu ... Stephen Brown
01 Aug 2021
01 Aug 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Design of Fully Spectral CNNs for Efficient FPGA-Based Acceleration.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems