DON6D: a decoupled one-stage network for 6D pose estimation

Zheng Wang,Hangyao Tu,Yutong Qian,Yanwei Zhao

doi:10.1038/s41598-024-59152-x

Abstract

The six-dimensional (6D) pose object estimation is a key task in robotic manipulation and grasping scenes. Many existing two-stage solutions with a slow inference speed require extra refinement to handle the challenges of variations in lighting, sensor noise, object occlusion, and truncation. To address these challenges, this work proposes a decoupled one-stage network (DON6D) model for 6D pose estimation that improves inference speed on the premise of maintaining accuracy. Particularly, since the RGB images are aligned with the RGB-D images, the proposed DON6D first uses a two-dimensional detection network to locate the interested objects in RGB-D images. Then, a module of feature extraction and fusion is used to extract color and geometric features fully. Further, dual data augmentation is performed to enhance the generalization ability of the proposed model. Finally, the features are fused, and an attention residual encoder–decoder, which can improve the pose estimation performance to obtain an accurate 6D pose, is introduced. The proposed DON6D model is evaluated on the LINEMOD and YCB-Video datasets. The results demonstrate that the proposed DON6D is superior to several state-of-the-art methods regarding the ADD(-S) and ADD(-S) AUC metrics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DON6D: a decoupled one-stage network for 6D pose estimation

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Journal: Scientific Reports	Publication Date: Apr 10, 2024
License type: CC BY 4.0

Similar Papers

A 3D object detection and pose estimation pipeline using RGB-D images
Ruotao He ... Juan Rojas
-
Ruotao He, et. al.Ruotao He ... Juan Rojas
01 Dec 2017
01 Dec 2017

3D hand pose and shape estimation from RGB images for keypoint-based hand gesture recognition
Danilo Avola ... Daniele Pannone
Pattern Recognition | VOL. 129
Danilo Avola, et. al.Danilo Avola ... Daniele Pannone
30 Apr 2022
Pattern Recognition | VOL. 129

CSA6D: Channel-Spatial Attention Networks for 6D Object Pose Estimation
Tao Chen ... Dongbing Gu
Cognitive Computation | VOL. 14
Tao Chen, et. al.Tao Chen ... Dongbing Gu
29 Nov 2021
Cognitive Computation | VOL. 14

A dual-source approach for 3D human pose estimation from single images
Umar Iqbal ... Juergen Gall
Computer Vision and Image Understanding | VOL. 172
Umar Iqbal, et. al.Umar Iqbal ... Juergen Gall
04 Apr 2018
Computer Vision and Image Understanding | VOL. 172

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DON6D: a decoupled one-stage network for 6D pose estimation

Abstract

Talk to us

Similar Papers

More From: Scientific Reports