Efficient Implementation of 2D and 3D Sparse Deconvolutional Neural Networks with a Uniform Architecture on FPGAs

Deguang Wang,Chunyuan Zhang,Mei Wen,Junzhong Shen

doi:10.3390/electronics8070803

Deguang Wang, Chunyuan Zhang + Show 2 more

Open Access

https://doi.org/10.3390/electronics8070803

Copy DOI

Journal: Electronics	Publication Date: Jul 18, 2019
Citations: 10	License type: CC BY 4.0

Affiliation: National University of Defense Technology

Abstract

Three-dimensional (3D) deconvolution is widely used in many computer vision applications. However, most previous works have only focused on accelerating two-dimensional (2D) deconvolutional neural networks (DCNNs) on Field-Programmable Gate Arrays (FPGAs), while the acceleration of 3D DCNNs has not been well studied in depth as they have higher computational complexity and sparsity than 2D DCNNs. In this paper, we focus on the acceleration of both 2D and 3D sparse DCNNs on FPGAs by proposing efficient schemes for mapping 2D and 3D sparse DCNNs on a uniform architecture. Firstly, a pruning method is used to prune unimportant network connections and increase the sparsity of weights. After being pruned, the number of parameters of DCNNs is reduced significantly without accuracy loss. Secondly, the remaining non-zero weights are encoded in coordinate (COO) format, reducing the memory demands of parameters. Finally, to demonstrate the effectiveness of our work, we implement our accelerator design on the Xilinx VC709 evaluation platform for four real-life 2D and 3D DCNNs. After the first two steps, the storage required of DCNNs is reduced up to 3.9×. Results show that the performance of our method on the accelerator outperforms that of the our prior work by 2.5× to 3.6× in latency.

Highlights

Deconvolution has become widely used in the fields of computer vision, such as semantic segmentation [1], generative models [2], and high-resolution imaging [3]
A pruning algorithm [12] is creatively applied on deconvolutional neural networks (DCNNs) to remove low-weight connections and the remaining non-zero weights are encoded in coordinate (COO) format, which can significantly reduce the size of DCNNs
We evaluate our design using the same four DCNN models with our prior work: DCGAN, GP-generative adversarial networks (GANs), 3D-GAN and V-Net

Summary

Introduction

Deconvolution has become widely used in the fields of computer vision, such as semantic segmentation [1], generative models [2], and high-resolution imaging [3]. By exploiting the sparsity of input activation, our previous work can significantly avoid the number of useless multiplication operations caused by inserting ’zero’, thereby achieving high throughput and optimum energy efficiency. On the basis of the uniform architecture previously proposed in [11], we further exploit the sparsity of weights and reduce the number of useless multiplication operations (that is, multiplications with zero-value weights). A pruning algorithm [12] is creatively applied on DCNNs to remove low-weight connections and the remaining non-zero weights are encoded in coordinate (COO) format, which can significantly reduce the size of DCNNs. We propose an efficient mapping scheme of 2D and 3D sparse DCNNs on the uniform architecture, which can efficiently improve the parallel computational ability and computational efficiency of the accelerator.

Related Work

Pruning and Encoding

Deconvolution

Result

Support for Sparse Weights

Support for the Acceleration of 2D and 3D DCNNs

Experiment Setup

Performance Analysis

Sensitive to Weights Sparsity

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Implementation of 2D and 3D Sparse Deconvolutional Neural Networks with a Uniform Architecture on FPGAs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Towards a Uniform Architecture for the Efficient Implementation of 2D and 3D Deconvolutional Neural Networks on FPGAs
Deguang Wang ... Chunyuan Zhang
-
Deguang Wang, et. al.Deguang Wang ... Chunyuan Zhang
01 May 2019
01 May 2019

RETRACTED ARTICLE: A novel cognitive Wallace compressor based multi operand adders in CNN architecture for FPGA
T Kowsalya
Journal of Ambient Intelligence and Humanized Computing | VOL. 12
T KowsalyaT Kowsalya
07 Aug 2020
Journal of Ambient Intelligence and Humanized Computing | VOL. 12

Optimizing FPGA-based convolutional neural networks accelerator for image super-resolution
Jung-Woo Chang ... Suk-Ju Kang
-
Jung-Woo Chang, et. al.Jung-Woo Chang ... Suk-Ju Kang
01 Jan 2018
01 Jan 2018

Optimizing FPGA-based convolutional neural networks accelerator for image super-resolution
...
-
, et. al. ...
22 Jan 2018
22 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Implementation of 2D and 3D Sparse Deconvolutional Neural Networks with a Uniform Architecture on FPGAs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics