A CNN Accelerator on FPGA Using Depthwise Separable Convolution

Lin Bai,Xinming Huang,Yiming Zhao

doi:10.1109/tcsii.2018.2865896

Lin Bai, Xinming Huang + Show 1 more

Open Access

https://doi.org/10.1109/tcsii.2018.2865896

Copy DOI

Abstract

Convolutional neural networks (CNNs) have been widely deployed in the fields of computer vision and pattern recognition because of their high accuracy. However, large convolution operations are computing-intensive that often requires a powerful computing platform such as Graphics Processing Unit (GPU). This makes it difficult to apply CNNs to portable devices. The state-of-the-art CNNs, such as MobileNetV2 and Xception, adopt depthwise separable convolution to replace the standard convolution for embedded platforms. That significantly reduces operations and parameters with only limited loss in accuracy. This highly structured model is very suitable for Field-Programmable Gate Array (FPGA) implementation. In this paper, a scalable high performance depthwise separable convolution optimized CNN accelerator is proposed. The accelerator can be fit into an FPGA of different sizes, provided the balancing between hardware resources and processing speed. As an example, MobileNetV2 is implemented on Arria 10 SoC FPGA, and the results show this accelerator can classify each picture from ImageNet in 3.75ms, which is about 266.6 frames per second. This achieves 20x speedup if compared to CPU.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Circuits and Systems II: Express Briefs	Publication Date: Oct 1, 2018
Citations: 170	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

A CNN Accelerator on FPGA Using Depthwise Separable Convolution

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Express Briefs

Lead the way for us

Similar Papers

A Comparative Analysis of HDL and HLS for Developing CNN Accelerators
S Srilakshmi ... G.L Madhumati
-
S Srilakshmi, et. al.S Srilakshmi ... G.L Madhumati
02 Feb 2023
02 Feb 2023

HARDWARE ACCELERATOR: IMPLEMENTATION OF CNN ON FPGA FOR DIGIT RECOGNITION
Onkar Choudhari ... V Ingale
-
Onkar Choudhari, et. al.Onkar Choudhari ... V Ingale
01 Jul 2020
01 Jul 2020

Research on Machine Learning Optimization Algorithm of CNN for FPGA Architecture
Xiaodong Zhao ... Fayang Chen
Journal of Physics: Conference Series | VOL. 2006
Xiaodong Zhao, et. al.Xiaodong Zhao ... Fayang Chen
01 Aug 2021
Journal of Physics: Conference Series | VOL. 2006

A Case Study for an Accelerated DCNN on FPGA-Based Embedded Distributed System
Anna Maria Nestorov ... Alberto Scolari
-
Anna Maria Nestorov, et. al.Anna Maria Nestorov ... Alberto Scolari
01 May 2019
01 May 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A CNN Accelerator on FPGA Using Depthwise Separable Convolution

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Express Briefs