CatFormer: Category-Level 6D Object Pose Estimation with Transformer

Sheng Yu,Yuanqing Xia,Di-Hua Zhai

doi:10.1609/aaai.v38i7.28505

Abstract

Although there has been significant progress in category-level object pose estimation in recent years, there is still considerable room for improvement. In this paper, we propose a novel transformer-based category-level 6D pose estimation method called CatFormer to enhance the accuracy pose estimation. CatFormer comprises three main parts: a coarse deformation part, a fine deformation part, and a recurrent refinement part. In the coarse and fine deformation sections, we introduce a transformer-based deformation module that performs point cloud deformation and completion in the feature space. Additionally, after each deformation, we incorporate a transformer-based graph module to adjust fused features and establish geometric and topological relationships between points based on these features. Furthermore, we present an end-to-end recurrent refinement module that enables the prior point cloud to deform multiple times according to real scene features. We evaluate CatFormer's performance by training and testing it on CAMERA25 and REAL275 datasets. Experimental results demonstrate that CatFormer surpasses state-of-the-art methods. Moreover, we extend the usage of CatFormer to instance-level object pose estimation on the LINEMOD dataset, as well as object pose estimation in real-world scenarios. The experimental results validate the effectiveness and generalization capabilities of CatFormer. Our code and the supplemental materials are avaliable at https://github.com/BIT-robot-group/CatFormer.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CatFormer: Category-Level 6D Object Pose Estimation with Transformer

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Pose estimation algorithm based on point pair features using PointNet + +
Yifan Chen ... Mingyue Zhang
Complex & Intelligent Systems | VOL. 10
Yifan Chen, et. al.Yifan Chen ... Mingyue Zhang
16 Jun 2024
Complex & Intelligent Systems | VOL. 10

3D driver pose estimation based on joint 2D–3D network
Zhijie Yao ... Shengmei Shen
IET Computer Vision | VOL. 14
Zhijie Yao, et. al.Zhijie Yao ... Shengmei Shen
29 Jan 2020
IET Computer Vision | VOL. 14

In-bed human pose estimation using multi-source information fusion for health monitoring in real-world scenarios
Yean Zhu ... Lang Shuai
Information Fusion | VOL. 105
Yean Zhu, et. al.Yean Zhu ... Lang Shuai
29 Dec 2023
Information Fusion | VOL. 105

Real-Time and Efficient 6-D Pose Estimation From a Single RGB Image
Jun Cheng ... Fei Wang
IEEE Transactions on Instrumentation and Measurement | VOL. 70
Jun Cheng, et. al.Jun Cheng ... Fei Wang
01 Jan 2020
IEEE Transactions on Instrumentation and Measurement | VOL. 70

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CatFormer: Category-Level 6D Object Pose Estimation with Transformer

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence