MonoAux: Fully Exploiting Auxiliary Information and Uncertainty for Monocular 3D Object Detection.

Zhenglin Li,Wenbo Zheng,Yang Zhou,Liyan Ma,Yan Peng,Le Yang

doi:10.34133/cbsystems.0097

Zhenglin Li, Wenbo Zheng + Show 4 more

Open Access

https://doi.org/10.34133/cbsystems.0097

Copy DOI

Abstract

Monocular 3D object detection plays a pivotal role in autonomous driving, presenting a formidable challenge by requiring the precise localization of 3D objects within a single image, devoid of depth information. Most existing methods in this domain fall short of harnessing the limited information available in monocular 3D detection tasks. They typically provide only a single detection outcome, omitting essential uncertainty analysis and result post-processing during model inference, thus limiting overall model performance. In this paper, we propose a comprehensive framework that maximizes information extraction from monocular images while encompassing diverse depth estimation and incorporating uncertainty analysis. Specifically, we mine additional information intrinsic to the monocular 3D detection task to augment supervision, thereby addressing the information scarcity challenge. Moreover, our framework handles depth estimation by recovering multiple sets of depth values from calculated visual heights. The final depth estimate and 3D confidence are determined through an uncertainty fusion process, effectively reducing inference errors. Furthermore, to address task weight allocation in multi-task training, we present a versatile training strategy tailored to monocular 3D detection. This approach leverages measurement indicators to monitor task progress, adaptively adjusting loss weights for different tasks. Experimental results on the KITTI and Waymo dataset confirm the effectiveness of our approach. The proposed method consistently provides enhanced performance across various difficulty levels compared to the original framework while maintaining real-time efficiency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Cyborg and bionic systems (Washington, D.C.)	Publication Date: Jan 1, 2024
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

MonoAux: Fully Exploiting Auxiliary Information and Uncertainty for Monocular 3D Object Detection.

Abstract

Talk to us

Similar Papers

More From: Cyborg and bionic systems (Washington, D.C.)

Lead the way for us

Similar Papers

Monocular depth estimation for vision-based vehicles based on a self-supervised learning method
Marco Tektonidis ... David Monnin
-
Marco Tektonidis, et. al.Marco Tektonidis ... David Monnin
23 Apr 2020
23 Apr 2020

Unsupervised depth estimation for ship target based on single view UAV image
Tao Liu ... Yuchi Huo
International Journal of Remote Sensing | VOL. 43
Tao Liu, et. al.Tao Liu ... Yuchi Huo
03 May 2022
International Journal of Remote Sensing | VOL. 43

SENSE: Self-Evolving Learning for Self-Supervised Monocular Depth Estimation.
Guanbin Li ... Haofeng Li
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society | VOL. 33
Guanbin Li, et. al.Guanbin Li ... Haofeng Li
01 Jan 2024
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society | VOL. 33

Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving
Yi-Nan Chen ... Yong Ding
-
Yi-Nan Chen, et. al.Yi-Nan Chen ... Yong Ding
01 Jun 2022
01 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MonoAux: Fully Exploiting Auxiliary Information and Uncertainty for Monocular 3D Object Detection.

Abstract

Talk to us

Similar Papers

More From: Cyborg and bionic systems (Washington, D.C.)