Laius: an energy-efficient FPGA CNN accelerator with the support of a fixed-point training framework

Qiang Dou,Rangyu Deng,Lei Wang,Zhisheng Li,Shasha Guo,Yu Deng,Zikai Nie

doi:10.1504/ijcse.2020.10027619

Abstract

With the development of convolutional neural networks (CNNs), their high computational complexity and energy consumption become significant problems. Many CNN inference accelerators are proposed to reduce the consumption. Most of them are based on 32-bit float-point matrix multiplication, where the data precision is over-provisioned. This paper presents Laius, an 8-bit fixed-point LeNet inference engine implemented on FPGA. To achieve low-precision computation and storage, we introduce our fixed-point training framework called FixCaffe. To economise FPGA resources, we proposed a methodology to find the optimal bit-length for weight and bias in LeNet. We use optimisations of pipelining, tiling, and theoretical analysis to improve the performance. Experiment results show that Laius achieves 44.9 Gops throughputs. Moreover, with only 1% accuracy loss, 8-bit Laius largely reduces 31.43% in delay, 87.01% in LUT consumption, 66.50% in BRAM consumption, 65.11% in DSP consumption and 47.95% in power compared to the 32-bit version with the same structure.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Laius: an energy-efficient FPGA CNN accelerator with the support of a fixed-point training framework

Abstract

Talk to us

Similar Papers

More From: International Journal of Computational Science and Engineering

Lead the way for us

Similar Papers

Fast and Efficient Convolutional Accelerator for Edge Computing
Arash Ardakani ... Carlo Condo
IEEE Transactions on Computers | VOL. 69
Arash Ardakani, et. al.Arash Ardakani ... Carlo Condo
01 Jan 2020
IEEE Transactions on Computers | VOL. 69

RANA: Towards Efficient Neural Acceleration with Refresh-Optimized Embedded DRAM
Fengbin Tu ... Shaojun Wei
-
Fengbin Tu, et. al.Fengbin Tu ... Shaojun Wei
01 Jun 2018
01 Jun 2018

A Survey on Convolutional Neural Network Accelerators: GPU, FPGA and ASIC
Yunxiang Hu ... Yuhao Liu
-
Yunxiang Hu, et. al.Yunxiang Hu ... Yuhao Liu
07 Jan 2022
07 Jan 2022

DeltaFrame-BP: An Algorithm Using Frame Difference for Deep Convolutional Neural Networks Training and Inference on Video Data
Bing Han ... Kaushik Roy
IEEE Transactions on Multi-Scale Computing Systems | VOL. 4
Bing Han, et. al.Bing Han ... Kaushik Roy
01 Oct 2018
IEEE Transactions on Multi-Scale Computing Systems | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Laius: an energy-efficient FPGA CNN accelerator with the support of a fixed-point training framework

Abstract

Talk to us

Similar Papers

More From: International Journal of Computational Science and Engineering