Visibility of points: Mining occlusion cues for monocular 3D object detection

Huazhen Chu,Lisha Mo,Rongquan Wang,Tianyu Hu,Huimin Ma

doi:10.1016/j.neucom.2022.06.099

Abstract

Monocular 3D object detection aims at achieving prediction from two-dimensional image plane to three-dimensional physical world. It is an inevitable problem that occlusion phenomena limit the performance in practice. To solve the challenging problem that directly represents the spatial information of occlusion relation, we propose the visibility states of points to describe the spatial distance relationships of occlusion pairs and the implied orientation information. The visibility state introduction can better represent the level and direction of occlusion information and enhance the network’s understanding of occlusion information. Furthermore, we redesign an end-to-end detector to encode features of visibility states to integrate occlusion ordering cues of the whole image to assist object localization in world space. Experiments on the KITTI3D dataset indicate that our method succeeds in establishing visibility states as occlusion cues and promoting the performance of the original detector. Our method is effective, and the performance is comparable with state-of-the-art approaches, especially outstanding in Moderate and Hard cases. Specifically, our method improves the accuracy of 3D moderate case detection to 42.75% and hard case to 37.03% in the KITTI3D dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Visibility of points: Mining occlusion cues for monocular 3D object detection

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Jun 30, 2022
Citations: 4

Similar Papers

Hard Cases and the Politics of Righteousness
Carl Schneider
Hastings Center Report | VOL. 35
Carl SchneiderCarl Schneider
01 Jan 2004
Hastings Center Report | VOL. 35

Language - words, only words?
Jaakko Husa
-
Jaakko HusaJaakko Husa
17 May 2022
17 May 2022

Are Hard Cases Vague Cases?
Ruth Chang
-
Ruth ChangRuth Chang
06 Dec 2021
06 Dec 2021

MonoDCN: Monocular 3D object detection based on dynamic convolution.
Shenming Qu ... Yiming Gao
PLOS ONE | VOL. 17
Shenming Qu, et. al.Shenming Qu ... Yiming Gao
04 Oct 2022
PLOS ONE | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Visibility of points: Mining occlusion cues for monocular 3D object detection

Abstract

Talk to us

Similar Papers

More From: Neurocomputing