ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

Danila Rukhovich,Anton Konushin,Anna Vorontsova

doi:10.1109/wacv51458.2022.00133

Danila Rukhovich, Anton Konushin + Show 1 more

Open Access

https://doi.org/10.1109/wacv51458.2022.00133

Copy DOI

Abstract

In this paper, we introduce the task of multi-view RGB-based 3D object detection as an end-to-end optimization problem. To address this problem, we propose ImVoxel-Net, a novel fully convolutional method of 3D object detection based on posed monocular or multi-view RGB images. The number of monocular images in each multi-view input can variate during training and inference; actually, this number might be unique for each multi-view input. ImVoxelNet successfully handles both indoor and outdoor scenes, which makes it general-purpose. Specifically, it achieves state-of-the-art results in car detection on KITTI (monocular) and nuScenes (multi-view) benchmarks among all methods that accept RGB images. Moreover, it surpasses existing RGB-based 3D object detection methods on the SUN RGB-D dataset. On ScanNet, ImVoxelNet sets a new benchmark for multi-view 3D object detection. The source code and the trained models are available at https://github.com/saic-vul/imvoxelnet.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

2D-to-3D Projection for Monocular and Multi-View 3D Object Detection in Outdoor Scenes
D D Rukhovich
Programmnaya Ingeneria | VOL. 12
D D RukhovichD D Rukhovich
11 Oct 2021
Programmnaya Ingeneria | VOL. 12

A Comprehensive Review on 3D Object Detection and 6D Pose Estimation With Deep Learning
Sabera Hoque ... Shuxiang Xu
IEEE Access | VOL. 9
Sabera Hoque, et. al.Sabera Hoque ... Shuxiang Xu
01 Jan 2020
IEEE Access | VOL. 9

3D object detection: Learning 3D bounding boxes from scaled down 2D bounding boxes in RGB-D images
Mohammad Muntasir Rahman ... Ke Lu
Information Sciences | VOL. 476
Mohammad Muntasir Rahman, et. al.Mohammad Muntasir Rahman ... Ke Lu
04 Oct 2018
Information Sciences | VOL. 476

SL3D - Single Look 3D Object Detection based on RGB-D Images
Gopi Krishna Erabati ... Helder Araujo
-
Gopi Krishna Erabati, et. al.Gopi Krishna Erabati ... Helder Araujo
29 Nov 2020
29 Nov 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

Abstract

Talk to us

Similar Papers