Data-driven 3D Voxel Patterns for object category recognition

Yu Xiang,Silvio Savarese,Wongun Choi Wongun Choi,Yuanqing Lin

doi:10.1109/cvpr.2015.7298800

Abstract

Despite the great progress achieved in recognizing objects as 2D bounding boxes in images, it is still very challenging to detect occluded objects and estimate the 3D properties of multiple objects from a single image. In this paper, we propose a novel object representation, 3D Voxel Pattern (3DVP), that jointly encodes the key properties of objects including appearance, 3D shape, viewpoint, occlusion and truncation. We discover 3DVPs in a data-driven way, and train a bank of specialized detectors for a dictionary of 3DVPs. The 3DVP detectors are capable of detecting objects with specific visibility patterns and transferring the meta-data from the 3DVPs to the detected objects, such as 2D segmentation mask, 3D pose as well as occlusion or truncation boundaries. The transferred meta-data allows us to infer the occlusion relationship among objects, which in turn provides improved object recognition results. Experiments are conducted on the KITTI detection benchmark [17] and the outdoor-scene dataset [41]. We improve state-of-the-art results on car detection and pose estimation with notable margins (6% in difficult data of KITTI). We also verify the ability of our method in accurately segmenting objects from the background and localizing them in 3D.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data-driven 3D Voxel Patterns for object category recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

3D hand pose and shape estimation from RGB images for keypoint-based hand gesture recognition
Danilo Avola ... Daniele Pannone
Pattern Recognition | VOL. 129
Danilo Avola, et. al.Danilo Avola ... Daniele Pannone
30 Apr 2022
Pattern Recognition | VOL. 129

Joint 3D Human Shape Recovery and Pose Estimation from a Single Image with Bilayer Graph
Xin Yu ... Jeroen Van Baar
-
Xin Yu, et. al.Xin Yu ... Jeroen Van Baar
01 Dec 2021
01 Dec 2021

Three-D Safari: Learning to Estimate Zebra Pose, Shape, and Texture From Images “In the Wild”
Silvia Zuffi ... Michael Black
-
Silvia Zuffi, et. al.Silvia Zuffi ... Michael Black
01 Oct 2019
01 Oct 2019

Simultaneous 3D face pose and person-specific shape estimation from a single image using a holistic approach
Fadi Dornaika ... Bogdan Raducanu
-
Fadi Dornaika, et. al.Fadi Dornaika ... Bogdan Raducanu
01 Dec 2009
01 Dec 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data-driven 3D Voxel Patterns for object category recognition

Abstract

Talk to us

Similar Papers