Towards efficient quantized inference for convolutional neural network on edge system

Hai Tan,Bo Lei,Nan Wang

doi:10.1088/1742-6596/2234/1/012006

Hai Tan, Bo Lei + Show 1 more

https://doi.org/10.1088/1742-6596/2234/1/012006

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Convolutional Neural Networks (CNN) have made splendid achievements in various object detection tasks. To extend the applications of CNN detection models, the implementation of model inference on edge platforms, such as ASIC, FPGA and other embedded systems, has been intensively investigated in recent years. However, the huge model size and its enormous overhead constrain the deployment of detection model on edge platform which always has limited computational capability. Quantized inference of CNN model is one of the efficient approach to running model on edge platform. In this paper, we develop a hardware-friendly quantized inference scheme of detection model that is used for efficient inference on embedded FPGA systems. The proposed method contains several techniques that are able to optimize the quantized inference of detection model on FPGA device. The experimental results demonstrate that not only make the detection model quantized inference more efficient but also maintain the accuracy of object detection.

Full Text

Published Version

Check institute access

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Towards efficient quantized inference for convolutional neural network on edge system

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Journal: Journal of Physics: Conference Series	Publication Date: Apr 1, 2022
License type: cc-by

Similar Papers

Performance Modeling of Computer Vision-based CNN on Edge GPUs
Halima Bouzidi ... Hamza Ouarnoughi
ACM Transactions on Embedded Computing Systems | VOL. 21
Halima Bouzidi, et. al.Halima Bouzidi ... Hamza Ouarnoughi
30 Sep 2022
ACM Transactions on Embedded Computing Systems | VOL. 21

SLO-Aware Inference Scheduler for Heterogeneous Processors in Edge Platforms
Wonik Seo ... Jaehyuk Huh
ACM Transactions on Architecture and Code Optimization | VOL. 18
Wonik Seo, et. al.Wonik Seo ... Jaehyuk Huh
17 Jul 2021
ACM Transactions on Architecture and Code Optimization | VOL. 18

Efficient Hardware Acceleration Techniques for Deep Learning on Edge Devices: A Comprehensive Performance Analysis
M.A Burhanuddin
KHWARIZMIA | VOL. 2023
M.A BurhanuddinM.A Burhanuddin
01 Aug 2023
KHWARIZMIA | VOL. 2023

A Survey of Object Detection Based on CNN and Transformer
Ershat Arkin ... Nurbiya Yadikar
-
Ershat Arkin, et. al.Ershat Arkin ... Nurbiya Yadikar
16 Jul 2021
16 Jul 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Towards efficient quantized inference for convolutional neural network on edge system

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series