A Mixed-Pruning Based Framework for Embedded Convolutional Neural Network Acceleration

Xuepeng Chang,Weiyang Lin,Huijun Gao,Huihui Pan

doi:10.1109/tcsi.2020.3048260

Abstract

Convolutional neural networks (CNN) have been proved to be an effective method in the field of artificial intelligence (AI), and large-scale deploying CNN to embedded devices, no doubt, will greatly promote the development and application of AI into the practical industry. However, mainly due to the space-time complexity of CNN, computing power, memory bandwidth and flexibility are performance bottlenecks. In this paper, a framework containing model compression and hardware acceleration is proposed to solve the above problems. This framework consists of a mixed pruning method, data storage optimization for efficient memory utilization and an accelerator for mapping CNN on field programmable gate array (FPGA). The mixed pruning method is used to compress the model, and data bit-width is reduced to 8-bit by data quantization. Accelerator based on FPGA makes it flexible, configurable and efficient for CNN implementation. The model compression is evaluated on NVIDIA RTX2080Ti, and the results illustrate that the VGG16 is compressed by 30× and the fully convolutional network (FCN) is compressed by 11× within 1% accuracy loss. The compressed model is deployed and accelerated on ZCU102, which is up to 1.7× and 24.5× better in energy efficiency compared with RTX2080Ti and Intel i7 7700.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Mixed-Pruning Based Framework for Embedded Convolutional Neural Network Acceleration

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems I: Regular Papers

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems I: Regular Papers	Publication Date: Jan 14, 2021
Citations: 51

Similar Papers

Focus issue: Artificial intelligence in medical physics.
F Zanca ... O Diaz
Physica Medica | VOL. 83
F Zanca, et. al.F Zanca ... O Diaz
01 Mar 2021
Physica Medica | VOL. 83

A Hybrid CNN Compression Approach for Hardware Acceleration
Xin Yue ... Yi Zhang
-
Xin Yue, et. al.Xin Yue ... Yi Zhang
11 Nov 2022
11 Nov 2022

Towards AGI: Cognitive Architecture Based on Hybrid and Bionic Principles
R V Dushkin
-
R V DushkinR V Dushkin
13 Jul 2021
13 Jul 2021

A Memory-Optimized and Energy-Efficient CNN Acceleration Architecture Based on FPGA
Xuepeng Chang ... Huihui Pan
-
Xuepeng Chang, et. al.Xuepeng Chang ... Huihui Pan
01 Jun 2019
01 Jun 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Mixed-Pruning Based Framework for Embedded Convolutional Neural Network Acceleration

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems I: Regular Papers