Towards a Uniform Template-based Architecture for Accelerating 2D and 3D CNNs on FPGA

Junzhong Shen,Chunyuan Zhang,Yuran Qiao,You Huang,Zelong Wang,Mei Wen

doi:10.1145/3174243.3174257

Abstract

Three-dimensional convolutional neural networks (3D CNNs) are used efficiently in many computer vision applications. Most previous work in this area has concentrated only on designing and optimizing accelerators for 2D CNN, with few attempts made to accelerate 3D CNN on FPGA. We find accelerating 3D CNNs on FPGA to be challenge due to their high computational complexity and storage demands. More importantly, although the computation patterns of 2D and 3D CNNs are analogous, the conventional approaches adopted for accelerating 2D CNNs may be unfit for 3D CNN acceleration. In this paper, in order to accelerate 2D and 3D CNNs using a uniform framework, we propose a uniform template-based architecture that uses templates based on the Winograd algorithm to ensure fast development of 2D and 3D CNN accelerators. Furthermore, we also develop a uniform analytical model to facilitate efficient design space explorations of 2D and 3D CNN accelerators based on our architecture. Finally, we demonstrate the effectiveness of the template-based architecture by implementing accelerators for real-life 2D and 3D CNNs (VGG16 and C3D) on multiple FPGA platforms. On S2C VUS440, we achieve up to 1.13 TOPS and 1.11 TOPS under low resource utilization for VGG16 and C3D, respectively. End-to-end comparisons with CPU and GPU solutions demonstrate that our implementation of C3D achieves gains of up to 13x and 60x in performance and energy relative to a CPU solution, and a 6.4x energy efficiency gain over a GPU solution.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards a Uniform Template-based Architecture for Accelerating 2D and 3D CNNs on FPGA

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Uniform Architecture Design for Accelerating 2D and 3D CNNs on FPGAs
Zhiqiang Liu ... Yong Dou
Electronics | VOL. 8
Zhiqiang Liu, et. al.Zhiqiang Liu ... Yong Dou
07 Jan 2019
Electronics | VOL. 8

F-E3D: FPGA-based Acceleration of an Efficient 3D Convolutional Neural Network for Human Action Recognition
Hongxiang Fan ... Shuanglong Liu
-
Hongxiang Fan, et. al.Hongxiang Fan ... Shuanglong Liu
01 Jul 2019
01 Jul 2019

Enhancing Brain Tumor Classification with a Novel Three-Dimensional Convolutional Neural Network (3D-CNN) Fusion Model
Maryam I Mousa Al-Khuzaie ... Waleed A Mahmoud Al-Jawher
Journal Port Science Research | VOL. 7
Maryam I Mousa Al-Khuzaie, et. al.Maryam I Mousa Al-Khuzaie ... Waleed A Mahmoud Al-Jawher
02 Aug 2024
Journal Port Science Research | VOL. 7

Accelerating 3D Convolutional Neural Networks Using 3D Fast Fourier Transform
Chao Fang ... Jinghe Wei
-
Chao Fang, et. al.Chao Fang ... Jinghe Wei
01 May 2021
01 May 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards a Uniform Template-based Architecture for Accelerating 2D and 3D CNNs on FPGA

Abstract

Talk to us

Similar Papers