Anchor Generation Optimization and Region of Interest Assignment for Vehicle Detection.

Ye Wang,Weiwen Deng,Zhenyi Liu

doi:10.3390/s19051089

Abstract

Region proposal network (RPN) based object detection, such as Faster Regions with CNN (Faster R-CNN), has gained considerable attention due to its high accuracy and fast speed. However, it has room for improvements when used in special application situations, such as the on-board vehicle detection. Original RPN locates multiscale anchors uniformly on each pixel of the last feature map and classifies whether an anchor is part of the foreground or background with one pixel in the last feature map. The receptive field of each pixel in the last feature map is fixed in the original faster R-CNN and does not coincide with the anchor size. Hence, only a certain part can be seen for large vehicles and too much useless information is contained in the feature for small vehicles. This reduces detection accuracy. Furthermore, the perspective projection results in the vehicle bounding box size becoming related to the bounding box position, thereby reducing the effectiveness and accuracy of the uniform anchor generation method. This reduces both detection accuracy and computing speed. After the region proposal stage, many regions of interest (ROI) are generated. The ROI pooling layer projects an ROI to the last feature map and forms a new feature map with a fixed size for final classification and box regression. The number of feature map pixels in the projected region can also influence the detection performance but this is not accurately controlled in former works. In this paper, the original faster R-CNN is optimized, especially for the on-board vehicle detection. This paper tries to solve these above-mentioned problems. The proposed method is tested on the KITTI dataset and the result shows a significant improvement without too many tricky parameter adjustments and training skills. The proposed method can also be used on other objects with obvious foreshortening effects, such as on-board pedestrian detection. The basic idea of the proposed method does not rely on concrete implementation and thus, most deep learning based object detectors with multiscale feature maps can be optimized with it.

Highlights

Vision-based advanced driver assistance system (V-ADAS) has drawn great attention from both researchers and manufacturers in recent years due to the advantages of its camera compared with other sensors
The proposed method is tested on the KITTI dataset and the result shows a significant improvement without too many tricky parameter adjustments and training skills
The proposed method is implemented with PyTorch 0.4, an open source deep learning framework developed by Facebook AI Research and accelerated with CUDA 8.0 and cuDNN 5.0

Summary

Introduction

Vision-based advanced driver assistance system (V-ADAS) has drawn great attention from both researchers and manufacturers in recent years due to the advantages (such as affordability, large information capacity and environmentally friendly) of its camera compared with other sensors. Vehicle detection methods generated candidate bounding boxes roughly through knowledge-based information, such as shadows [1,2], symmetry [3,4] and vertical/horizontal edges [5,6]. They classified these candidate bounding boxes through predefined feature extractors, such as Harr, Histogram of Oriented. Cityscapes [11], and the progress of the GPU computing annotated image datasets, such as Pascal [9], KITTI [10] and Cityscapes [11], and the progress of the speed,computing data-driven convolutional neural networks (CNN).

Methods

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Mar 3, 2019
Citations: 24	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Anchor Generation Optimization and Region of Interest Assignment for Vehicle Detection.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Text Detection by Faster R-CNN with Multiple Region Proposal Networks
Yoshito Nagaoka ... Yoshihiro Sugaya
-
Yoshito Nagaoka, et. al.Yoshito Nagaoka ... Yoshihiro Sugaya
01 Nov 2017
01 Nov 2017

An algorithm for automatic identification of multiple developmental stages of rice spikes based on improved Faster R-CNN
Yuanqin Zhang ... Huilin Wu
The Crop Journal | VOL. 10
Yuanqin Zhang, et. al.Yuanqin Zhang ... Huilin Wu
19 Jul 2022
The Crop Journal | VOL. 10

A comprehensive swarming intelligent method for optimizing deep learning-based object detection by unmanned ground vehicles.
Qian Xu ... Ling Shi
PLOS ONE | VOL. 16
Qian Xu, et. al.Qian Xu ... Ling Shi
13 May 2021
PLOS ONE | VOL. 16

Textile Fabric Defect Detection Based on Improved Faster R-CNN
Dongfang He ... Zhihui Lai
AATCC Journal of Research | VOL. 8
Dongfang He, et. al.Dongfang He ... Zhihui Lai
01 Sep 2021
AATCC Journal of Research | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Anchor Generation Optimization and Region of Interest Assignment for Vehicle Detection.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors