A Fifo Based Accelerator for Convolutional Neural Networks

Vineet Panchbhaiyye,Tokunbo Ogunfunmi

doi:10.1109/icassp40776.2020.9053228

Abstract

In recent years, Deep Neural Networks (DNNs) have achieved state-of-the-art results in various fields like Computer Vision, Natural Language Processing and Speech Recognition. Of all the DNN architectures, Convolutional Neural Networks (CNNs) have been most effective in tasks like image classification and object detection. The high performance of the CNNs comes at the cost of computational complexity. Currently Graphics Processing Units (GPUs) are used to accelerate CNN training and inference on workstations and data servers. Though popular, GPUs are not suitable for embedded applications because they are not energy efficient. ASIC and FPGA accelerators have the potential to run CNNs that are optimized for energy and performance.In this paper we present an architecture which takes a novel approach to compute convolution results using row-wise inputs as opposed to traditional tile-based processing. We are able to exceed the results of state of the art architectures when implemented on an inexpensive PYNQ Z1 board running at 100Mhz. The total latency to run the convolution layers in the VGG16 benchmark is nearly 1.5x lower for our architecture than state of the art architectures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Fifo Based Accelerator for Convolutional Neural Networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

CPU-Accelerator Co-Scheduling for CNN Acceleration at the Edge
Yeongmin Kim ... Arslan Munir
IEEE Access | VOL. 8
Yeongmin Kim, et. al.Yeongmin Kim ... Arslan Munir
01 Jan 2020
IEEE Access | VOL. 8

A 7.663-TOPS 8.2-W Energy-efficient FPGA Accelerator for Binary Convolutional Neural Networks (Abstract Only)
Yixing Li ... Kai Xu
-
Yixing Li, et. al.Yixing Li ... Kai Xu
22 Feb 2017
A 7.663-TOPS 8.2-W Energy-efficient FPGA Accelerator for Binary Convolutional Neural Networks (Abstract Only)
Yixing Li ... Kai Xu

An Uninterrupted Processing Technique-Based High-Throughput and Energy-Efficient Hardware Accelerator for Convolutional Neural Networks
Md Najrul Islam ... Rahul Shrestha
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 30
Md Najrul Islam, et. al.Md Najrul Islam ... Rahul Shrestha
01 Dec 2022
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 30

Improving the Performance of CNN Accelerator Architecture under the Impact of Process Variations
Jingweijia Tan ... Weiren Wang
ACM Transactions on Design Automation of Electronic Systems | VOL. 28
Jingweijia Tan, et. al.Jingweijia Tan ... Weiren Wang
09 Sep 2023
ACM Transactions on Design Automation of Electronic Systems | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Fifo Based Accelerator for Convolutional Neural Networks

Abstract

Talk to us

Similar Papers