Abstract

General object-detection methods based on deep learning have received considerable attention in the field of computer vision. However, when they are applied to vehicle detection (VD) in a straightforward manner to realize an intelligent vehicle (IV), a graphics processing unit (GPU) is required for their real-time implementation. The use of GPUs is unacceptable in commercial VD systems. A novel on-road VD method comprising the use of a multi-stage convolutional neural network (MSCNN) is proposed to solve this problem. In the MSCNN, the properties of the vehicles are exploited, and an efficient region proposal specialized for vehicles is developed. The proposed MSCNN comprises four stages: vehicle lower-boundary detection, vehicle upper-boundary detection, region proposal network (RPN), and vehicle classification. Effective anchor boxes are generated in the lower- and upper-boundary-detection stages with appropriate sizes for vehicles of all scales. The bounding box of the vehicle within the anchor box is determined in the RPN stage. In the last stage, the predicted bounding boxes are classified as vehicles or non-vehicles. Finally, the proposed method is applied to the KITTI, CrowdAI, and AUTTI datasets, and its advantages are demonstrated by comparing its performance with those presented in previous studies. The proposed MSCNN realizes an average precision (72.1%) on the KITTI dataset while running on a central processing unit (CPU).

Highlights

  • Autonomous driving is considered as one of the most attractive emerging technologies in the industry and academia

  • This study focuses on vehicle detection (VD) alone with the use of a monocular camera [4]- [7]

  • EXPERIMENTAL RESULTS the proposed multi-stage convolutional neural network (MSCNN) is applied to three datasets: KITTI VD [30], AUTTI [31], and CrowdAI [31]

Read more

Summary

INTRODUCTION

Autonomous driving is considered as one of the most attractive emerging technologies in the industry and academia. To solve the above problems, a new VD method called multi-stage CNN (MSCNN) based on deep learning is proposed in this study. 3) The MSCNN demonstrates a higher detection performance for small vehicles than a general OD, such as YOLOv3 and YOLOv3-tiny. This is because, as the MSCNN is implemented using shallow networks, a considerably smaller number of pooling layers is used in the MSCNN than that in the general OD, preventing the loss of information.

SYSTEM OVERVIEW
TRAINING PROCESS
EXPERIMENTAL RESULTS
KITTI DATASET
CROWDAI DATASET
AUTTI DATASET
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.