Back to Reality: Learning Data-Efficient 3D Object Detector With Shape Guidance.

Xiuwei Xu,Ziwei Wang,Jie Zhou,Jiwen Lu

doi:10.1109/tpami.2023.3328880

Abstract

In this paper, we propose a weakly-supervised approach for 3D object detection, which makes it possible to train a strong 3D detector with position-level annotations (i.e. annotations of object centers and categories). In order to remedy the information loss from box annotations to centers, our method makes use of synthetic 3D shapes to convert the position-level annotations into virtual scenes with box-level annotations, and in turn utilizes the fully-annotated virtual scenes to complement the real labels. Specifically, we first present a shape-guided label-enhancement method, which assembles 3D shapes into physically reasonable virtual scenes according to the coarse scene layout extracted from position-level annotations. Then we transfer the information contained in the virtual scenes back to real ones by applying a virtual-to-real domain adaptation method, which refines the annotated object centers and additionally supervises the training of detector with the virtual scenes. Since the shape-guided label enhancement method generates virtual scenes by human-heuristic physical constraints, the layout of the fixed virtual scenes may be unreasonable with varied object combinations. To address this, we further present differentiable label enhancement to optimize the virtual scenes including object scales, orientations and locations in a data-driven manner. Moreover, we further propose a label-assisted self-training strategy to fully exploit the capability of detector. By reusing the position-level annotations and virtual scenes, we fuse the information from both domains and generate box-level pseudo labels on the real scenes, which enables us to directly train a detector in fully-supervised manner. Extensive experiments on the widely used ScanNet and Matterport3D datasets show that our approach surpasses current weakly-supervised and semi-supervised methods by a large margin, and achieves comparable detection performance with some popular fully-supervised methods with less than 5% of the labeling labor.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Back to Reality: Learning Data-Efficient 3D Object Detector With Shape Guidance.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Feb 1, 2024
Citations: 1

Similar Papers

Back to Reality: Weakly-supervised 3D Object Detection with Shape-guided Label Enhancement
Xiuwei Xu ... Jie Zhou
-
Xiuwei Xu, et. al.Xiuwei Xu ... Jie Zhou
01 Jun 2022
01 Jun 2022

Fusion of virtual and real scenes for the data preparation of holographic stereogram
Chenqing Wang ... Yunpeng Liu
-
Chenqing Wang, et. al.Chenqing Wang ... Yunpeng Liu
10 Oct 2020
10 Oct 2020

Fusion Coding of 3D Real and Virtual Scenes Information for Augmented Reality-Based Holographic Stereogram
Yunpeng Liu ... Song Chen
Frontiers in Physics | VOL. 9
Yunpeng Liu, et. al.Yunpeng Liu ... Song Chen
24 Aug 2021
Frontiers in Physics | VOL. 9

Stereoscopic stimuli are not used in absolute distance evaluation to proximal objects in multicue virtual environment
Damien Paille ... Andras Kemeny
-
Damien Paille, et. al.Damien Paille ... Andras Kemeny
22 Mar 2005
22 Mar 2005

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Back to Reality: Learning Data-Efficient 3D Object Detector With Shape Guidance.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence