An Efficient CNN Accelerator for Low-Cost Edge Systems

Kyubaik Choi,Gerald E Sobelman

doi:10.1145/3539224

Abstract

Customized hardware based convolutional neural network ( CNN or ConvNet ) accelerators have attracted significant attention for applications in a low-cost, edge computing system. However, there is a lack of research that seeks to optimize at both the algorithm and hardware levels simultaneously in resource-constrained FPGA systems. In this paper, we first analyze ConvNet models to find one that is most suitable for a low-cost FPGA implementation. Based on the analysis, we select MobileNetV2 as the backbone of our research due to its hardware-friendly structure. We use a quantized implementation with 4-bit precision and optimize further with a smaller input resolution of 192 × 192 to obtain a 68.8% detection accuracy on ImageNet, which represents only a 3.2% accuracy loss compared to a floating-point model that uses the full input size. We then develop a hardware implementation that uses a low-cost FPGA. To accelerate the depth-wise separable ConvNet and utilize DRAM resources efficiently with parallel processing, we propose a novel scoreboard architecture to dynamically schedule DRAM data requests in order to maintain a high hardware utilization. The number of DSP blocks used is about six times smaller than in prior work. In addition, internal block RAM utilization is approximately nine times more efficient than in prior work. Our proposed design achieves 3.07 frames per second (FPS) on the low-cost and resource constrained FPGA system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Efficient CNN Accelerator for Low-Cost Edge Systems

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Embedded Computing Systems

Lead the way for us

Journal: ACM Transactions on Embedded Computing Systems	Publication Date: Jul 31, 2022
Citations: 6

Similar Papers

Retraining-Based Timing Error Mitigation for Hardware Neural Networks
Jiachao Deng ... Chengyong Wu
-
Jiachao Deng, et. al.Jiachao Deng ... Chengyong Wu
01 Jan 2015
01 Jan 2015

Retraining-based timing error mitigation for hardware neural networks
...
-
, et. al. ...
09 Mar 2015
09 Mar 2015

Accelerating applications using edge tensor processing units
Kuan-Chieh Hsu ... Hung-Wei Tseng
-
Kuan-Chieh Hsu, et. al.Kuan-Chieh Hsu ... Hung-Wei Tseng
13 Nov 2021
13 Nov 2021

Photonic Computing and Communication for Neural Network Accelerators
Chengpeng Xia ... Hao Zhang
-
Chengpeng Xia, et. al.Chengpeng Xia ... Hao Zhang
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Efficient CNN Accelerator for Low-Cost Edge Systems

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Embedded Computing Systems