PDR: Progressive Depth Regularization for Monocular 3D Object Detection

Hualian Sheng,Gim Hee Lee,Bing Deng,Na Zhao,Min-Jian Zhao,Sijia Cai

doi:10.1109/tcsvt.2023.3276518

Abstract

Accurately predicting object depth is a key challenge in monocular 3D detection task. The perspective projection principle used by most state-of-the-art approaches demands a complex balance between the ratio-form depth estimation and 2D-3D geometric regularizations, and thus can lead to sub-optimal solutions. In this paper, we propose a novel synergistic scheme that can achieve better trade-off among these competing objectives. Our main proposal is a progressive depth regularization (PDR) architecture that splits the overall training process into three sequential depth estimation steps to gradually remove the unwanted deviations induced by the over-regularization. Specifically, our model first learns the coarse depth with the conventional perspective projection and combines the coarse-to-fine generation to reduce the search space of 2D projection height prediction. We then deactivate individual supervision on 2D projection height prediction and introduces a new auxiliary 3D physical height prediction to relax the 2D and 3D regularizations, respectively. Consequently, our PDR leads to more precise depth estimation by mitigating the inherent ambiguities in the geometric priors of perspective projection through progressive regularization relaxation. Extensive experiments on both KITTI and Rope3D benchmark show that our PDR delivers strong performance gains as compared to the previous methods.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PDR: Progressive Depth Regularization for Monocular 3D Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Dec 1, 2023
Citations: 3

Similar Papers

Adversarial Learning for Joint Optimization of Depth and Ego-Motion.
Anjie Wang ... Yongbin Gao
IEEE Transactions on Image Processing | VOL. 29
Anjie Wang, et. al.Anjie Wang ... Yongbin Gao
01 Jan 2020
IEEE Transactions on Image Processing | VOL. 29

Spatial-frequency analysis in the perception of perspective depth
Ko Sakai ... Leif H Finkel
Network (Bristol, England) | VOL. 8
Ko Sakai, et. al.Ko Sakai ... Leif H Finkel
01 Jan 1997
Network (Bristol, England) | VOL. 8

Spatial-frequency analysis in the perception of perspective depth
Ko Sakai ... Leif Finkel
Network (Bristol, England) | VOL. 8
Ko Sakai, et. al.Ko Sakai ... Leif Finkel
01 Aug 1997
Network (Bristol, England) | VOL. 8

SPLODE: Semi-probabilistic point and line odometry with depth estimation from RGB-D camera motion
Pedro F Proenca ... Yang Gao
-
Pedro F Proenca, et. al.Pedro F Proenca ... Yang Gao
01 Sep 2017
01 Sep 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PDR: Progressive Depth Regularization for Monocular 3D Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society