DiffCAD: Weakly-Supervised Probabilistic CAD Model Retrieval and Alignment from an RGB Image

Daoyi Gao,David Rozenberszki,Angela Dai,Stefan Leutenegger

doi:10.1145/3658236

Abstract

Perceiving 3D structures from RGB images based on CAD model primitives can enable an effective, efficient 3D object-based representation of scenes. However, current approaches rely on supervision from expensive yet imperfect annotations of CAD models associated with real images, and encounter challenges due to the inherent ambiguities in the task - both in depth-scale ambiguity in monocular perception, as well as inexact matches of CAD database models to real observations. We thus propose DiffCAD, the first weakly-supervised probabilistic approach to CAD retrieval and alignment from an RGB image. We learn a probabilistic model through diffusion, modeling likely distributions of shape, pose, and scale of CAD objects in an image. This enables multi-hypothesis generation of different plausible CAD reconstructions, requiring only a few hypotheses to characterize ambiguities in depth/scale and inexact shape matches. Our approach is trained only on synthetic data, leveraging monocular depth and mask estimates to enable robust zero-shot adaptation to various real target domains. Despite being trained solely on synthetic data, our multi-hypothesis approach can even surpass the supervised state-of-the-art on the Scan2CAD dataset by 5.9% with 8 hypotheses.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DiffCAD: Weakly-Supervised Probabilistic CAD Model Retrieval and Alignment from an RGB Image

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Graphics

Lead the way for us

Similar Papers

Relative Pose Estimation between Image Object and ShapeNet CAD Model for Automatic 4-DoF Annotation
Soon-Yong Park ... Chang-Min Son
Applied Sciences | VOL. 13
Soon-Yong Park, et. al.Soon-Yong Park ... Chang-Min Son
04 Jan 2023
Applied Sciences | VOL. 13

Learning Local RGB-to-CAD Correspondences for Object Pose Estimation
Georgios Georgakis ... Jana Kosecka
-
Georgios Georgakis, et. al.Georgios Georgakis ... Jana Kosecka
01 Oct 2019
01 Oct 2019

Learning to Estimate Pose and Shape of Hand-Held Objects from RGB Images
Mia Kokic ... Danica Kragic
-
Mia Kokic, et. al.Mia Kokic ... Danica Kragic
01 Nov 2019
01 Nov 2019

Pose Estimation from RGB Images of Highly Symmetric Objects using a Novel Multi-Pose Loss and Differential Rendering
Stefan Hein Bengtson ... Elin A Topp
-
Stefan Hein Bengtson, et. al.Stefan Hein Bengtson ... Elin A Topp
27 Sep 2021
27 Sep 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DiffCAD: Weakly-Supervised Probabilistic CAD Model Retrieval and Alignment from an RGB Image

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Graphics