A novel hardware-oriented ultra-high-speed object detection algorithm based on convolutional neural network

Jianquan Li,Shenhua Hu,Qingyi Gu,Yiming Hu,De Xu De Xu,Xianlei Long

doi:10.1007/s11554-019-00931-5

Abstract

This paper describes a hardware-oriented two-stage algorithm that can be deployed in a resource-limited field-programmable gate array (FPGA) for fast-object detection and recognition with out external memory. The first stage is the bounding boxes proposal with a conventional object detection method, and the second is convolutional neural network (CNN)-based classification for accuracy improvement. Frequently accessing external memories significantly affects the execution efficiency of object classification. Unfortunately, the existing CNN models with a large number of parameters are difficult to deploy in FPGAs with limited on-chip memory resources. In this study, we designed a compact CNN model and performed the hardware-oriented quantization for parameters and intermediate results. As a result, CNN-based ultra-fast-object classification was realized with all parameters and intermediate results stored on chip. Several evaluations were performed to demonstrate the performance of the proposed algorithm. The object classification module consumes only 163.67 Kbits of on-chip memories for ten regions of interest (ROIs), this is suitable for low-end FPGA devices. In the aspect of accuracy, our method provides a correctness rate of 98.01% in open-source data set MNIST and over 96.5% in other three self-built data sets, which is distinctly better than conventional ultra-high-speed object detection algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A novel hardware-oriented ultra-high-speed object detection algorithm based on convolutional neural network

Abstract

Talk to us

Similar Papers

More From: Journal of Real-Time Image Processing

Lead the way for us

Journal: Journal of Real-Time Image Processing	Publication Date: Dec 21, 2019
Citations: 6

Similar Papers

Fixed Point Implementation of Tiny-Yolo-v2 using OpenCL on FPGA
Yap June Wai ... Lim Kim
International Journal of Advanced Computer Science and Applications | VOL. 9
Yap June Wai, et. al.Yap June Wai ... Lim Kim
01 Jan 2018
International Journal of Advanced Computer Science and Applications | VOL. 9

Deep Learning Based Binary Classification for Alzheimer’s Disease Detection using Brain MRI Images
Emtiaz Hussain ... Mahmudul Hasan
-
Emtiaz Hussain, et. al.Emtiaz Hussain ... Mahmudul Hasan
29 May 2020
29 May 2020

Instruction Driven Cross-layer CNN Accelerator for Fast Detection on FPGA
Jincheng Yu ... Yu Wang
ACM Transactions on Reconfigurable Technology and Systems | VOL. 11
Jincheng Yu, et. al.Jincheng Yu ... Yu Wang
30 Sep 2018
ACM Transactions on Reconfigurable Technology and Systems | VOL. 11

An FPGA Design Framework for CNN Sparsification and Acceleration
Sicheng Li ... Yu Wang
-
Sicheng Li, et. al.Sicheng Li ... Yu Wang
01 Apr 2017
01 Apr 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A novel hardware-oriented ultra-high-speed object detection algorithm based on convolutional neural network

Abstract

Talk to us

Similar Papers

More From: Journal of Real-Time Image Processing