Efficient Hardware Acceleration of Convolutional Neural Networks

S Kala,Jimson Mathew,S Nalesh,Babita R Jose

doi:10.1109/socc46988.2019.1570573948

Abstract

Convolutional neural networks (CNNs) have emerged as the most efficient technique for solving a host of machine learning tasks. Compute and memory intensive nature of CNN has stimulated lot of work in hardware acceleration of these network models. FPGAs have emerged as a promising approach for accelerating CNNs, due to its high performance, flexibility and energy efficiency. We propose a unified architecture named UniWiG, where both Winograd based convolution and general matrix multiplication (GEMM) can be accelerated using the same set of processing elements. Proposed architecture has been used to accelerate AlexNet and VGG-16 models on FPGA with a performance of 433.63 GOPS and 407.23 GOPS respectively. We have also analyzed the performance with varying Winograd tile sizes and found out the most appropriate tile sizes for maximizing the performance while reducing on-chip memory resource.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Hardware Acceleration of Convolutional Neural Networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

UniWiG: Unified Winograd-GEMM Architecture for Accelerating CNN on FPGAs
S Kala ... Babita R Jose
-
S Kala, et. al.S Kala ... Babita R Jose
01 Jan 2019
01 Jan 2019

High-Performance CNN Accelerator on FPGA Using Unified Winograd-GEMM Architecture
S Kala ... Babita R Jose
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 27
S Kala, et. al.S Kala ... Babita R Jose
01 Dec 2019
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 27

3D CNN Hardware Circuit for Motion Recognition Based on FPGA
Shidong Lv ... Tao Long
Journal of Physics: Conference Series | VOL. 2363
Shidong Lv, et. al.Shidong Lv ... Tao Long
01 Nov 2022
Journal of Physics: Conference Series | VOL. 2363

A Survey of Convolutional Neural Networks on Edge with Reconfigurable Computing
Mário P Véstias
Algorithms | VOL. 12
Mário P VéstiasMário P Véstias
31 Jul 2019
Algorithms | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Hardware Acceleration of Convolutional Neural Networks

Abstract

Talk to us

Similar Papers