Zero‐Shot 3D Pose Estimation of Unseen Object by Two‐step RGB-D Fusion

Guifang Duan,Shuai Cheng,Zhenyu Liu,Yanglun Zheng,Yunhai Su,Jianrong Tan

doi:10.1016/j.neucom.2024.128041

Abstract

3D Object pose estimation is a critical task in many real-world applications, e.g., robotic manipulation and augmented reality. Most existing methods focus on estimating the object instances or categories which have been seen in the training phase. However, it is imperative to estimate the pose of unseen objects without re-training the network in real world. Therefore, we proposed a 3D pose estimation method for unseen objects without re-training. Specifically, given the CAD model of the unseen object, a set of template RGB-D images (RGB images and depth images) is rendered at different viewpoints. Then a feature embedding network, named PoseFusion, is designed to extract the scene feature. In this network, RGB-D images are utilized to extract the texture feature and geometric feature, respectively. Afterwards, a cross-modality alignment module is proposed to eliminate the noise in single modality. The aligned texture feature and aligned geometric feature are fused through a geometry guided fusion module. Thus, by PoseFusion, the template RGB-D images generated from the CAD model are abstracted into a set of template scene features, and the query scene features are also embedded from the captured RGB-D images from the unseen object. Finally, the query scene features are matched with the template scene features by calculating the masked local similarity. Then the identity and pose of unseen object are determined by the most similar template. Experiments on LINEMOD and T-LESS datasets demonstrate that our method outperforms other methods and generalizes better to unseen objects. Extensive ablation studies are performed to verify the effectiveness of the PoseFusion.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Zero‐Shot 3D Pose Estimation of Unseen Object by Two‐step RGB-D Fusion

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Jun 10, 2024
Citations: 1

Similar Papers

Semantic part segmentation method based 3D object pose estimation with RGB-D images for bin-picking
Chungang Zhuang ... Han Ding
Robotics and Computer-Integrated Manufacturing | VOL. 68
Chungang Zhuang, et. al.Chungang Zhuang ... Han Ding
08 Nov 2020
Robotics and Computer-Integrated Manufacturing | VOL. 68

Learning latent geometric consistency for 6D object pose estimation in heavily cluttered scenes
Qingnan Li ... Yu Chen
Journal of Visual Communication and Image Representation | VOL. 70
Qingnan Li, et. al.Qingnan Li ... Yu Chen
11 Mar 2020
Journal of Visual Communication and Image Representation | VOL. 70

A 3D object detection and pose estimation pipeline using RGB-D images
Ruotao He ... Juan Rojas
-
Ruotao He, et. al.Ruotao He ... Juan Rojas
01 Dec 2017
01 Dec 2017

RobotP: A Benchmark Dataset for 6D Object Pose Estimation.
Honglin Yuan ... Remco C Veltkamp
Sensors | VOL. 21
Honglin Yuan, et. al.Honglin Yuan ... Remco C Veltkamp
11 Feb 2021
Sensors | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Zero‐Shot 3D Pose Estimation of Unseen Object by Two‐step RGB-D Fusion

Abstract

Talk to us

Similar Papers

More From: Neurocomputing