On the Performance of One-Stage and Two-Stage Object Detectors in Autonomous Vehicles Using Camera Data

Manuel Carranza-García,Jorge García-Gutiérrez,Jesús Torres-Mateo,Pedro Lara-Benítez

doi:10.3390/rs13010089

Abstract

Object detection using remote sensing data is a key task of the perception systems of self-driving vehicles. While many generic deep learning architectures have been proposed for this problem, there is little guidance on their suitability when using them in a particular scenario such as autonomous driving. In this work, we aim to assess the performance of existing 2D detection systems on a multi-class problem (vehicles, pedestrians, and cyclists) with images obtained from the on-board camera sensors of a car. We evaluate several one-stage (RetinaNet, FCOS, and YOLOv3) and two-stage (Faster R-CNN) deep learning meta-architectures under different image resolutions and feature extractors (ResNet, ResNeXt, Res2Net, DarkNet, and MobileNet). These models are trained using transfer learning and compared in terms of both precision and efficiency, with special attention to the real-time requirements of this context. For the experimental study, we use the Waymo Open Dataset, which is the largest existing benchmark. Despite the rising popularity of one-stage detectors, our findings show that two-stage detectors still provide the most robust performance. Faster R-CNN models outperform one-stage detectors in accuracy, being also more reliable in the detection of minority classes. Faster R-CNN Res2Net-101 achieves the best speed/accuracy tradeoff but needs lower resolution images to reach real-time speed. Furthermore, the anchor-free FCOS detector is a slightly faster alternative to RetinaNet, with similar precision and lower memory usage.

Highlights

The increase in availability and quality of remote sensing data provided by modern multi-modal sensors has allowed pushing the state-of-the-art in many computer vision tasks
We study the combination of onestage (RetinaNet, Fully Convolutional One-Stage Object Detector (FCOS), YOLOv3) and two-stage (Faster R-convolutional neural networks (CNNs)) meta-architectures with different feature extractors (ResNet-50, Residual Networks (ResNet)-101, ResNet-152, ResNeXt-101, Res2Net-101, DarkNet-53, MobileNet V1, MobileNet V2)
We present an experimental study comparing the performance of several deep learning-based object detection systems in the context of autonomous vehicles

Summary

Introduction

The increase in availability and quality of remote sensing data provided by modern multi-modal sensors has allowed pushing the state-of-the-art in many computer vision tasks. The data provided by high-resolution cameras and proximity sensors have helped to develop more powerful machine learning models that have achieved unprecedented results in visual recognition problems [1] These developments have significantly improved the perception systems used in many applications such as autonomous driving [2,3], security surveillance [4], or land monitoring [5]. One of the essential tasks that an ADAS needs to address is object detection These remote sensing systems need to detect traffic targets in real time in order to make informed driving decisions. They have to be robust enough to operate effectively in complex scenarios such as adverse weather, poor lighting, or occluded objects.

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Remote Sensing	Publication Date: Dec 29, 2020
Citations: 103	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

On the Performance of One-Stage and Two-Stage Object Detectors in Autonomous Vehicles Using Camera Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Remote Sensing

Lead the way for us

Similar Papers

Object detection using depth completion and camera-LiDAR fusion for autonomous driving
Manuel Carranza-García ... F Javier Galán-Sales
Integrated Computer-Aided Engineering | VOL. 29
Manuel Carranza-García, et. al.Manuel Carranza-García ... F Javier Galán-Sales
21 Jun 2022
Integrated Computer-Aided Engineering | VOL. 29

MimicDet: Bridging the Gap Between One-Stage and Two-Stage Object Detection
Xin Lu ... Quanquan Li
-
Xin Lu, et. al.Xin Lu ... Quanquan Li
01 Jan 2020
01 Jan 2020

Enhancing object detection for autonomous driving by optimizing anchor generation and addressing class imbalance
Manuel Carranza-García ... José C Riquelme
Neurocomputing | VOL. 449
Manuel Carranza-García, et. al.Manuel Carranza-García ... José C Riquelme
06 Apr 2021
Neurocomputing | VOL. 449

Deep Learning-Based Object Detection: An Investigation
Kanojia Sindhuben Babulal ... Amit Kumar Das
-
Kanojia Sindhuben Babulal, et. al.Kanojia Sindhuben Babulal ... Amit Kumar Das
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the Performance of One-Stage and Two-Stage Object Detectors in Autonomous Vehicles Using Camera Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Remote Sensing