DenseFusion-DA2: End-to-End Pose-Estimation Network Based on RGB-D Sensors and Multi-Channel Attention Mechanisms.

Hanqi Li,Bingyou Liu,Guoyang Wan,Hong Zhang,Chengwen Wang,Xuna Li

doi:10.3390/s24206643

Abstract

Notably, 6D pose estimation is a critical technology that enables robotics to perceive and interact with their operational environment. However, occlusion causes a loss of local features, which, in turn, restricts the estimation accuracy. To address these challenges, this paper proposes an end-to-end pose-estimation network based on a multi-channel attention mechanism, DA2Net. Firstly, a multi-channel attention mechanism, designated as "DA2Net", was devised using A2-Nets as its foundation. This mechanism is constructed in two steps. In the first step, the essential characteristics are extracted from the global feature space through the second-order attention pool. In the second step, a feature map is generated by the integration of position and channel attention. Subsequently, the extracted key features are assigned to each position of the feature map, enhancing both the feature representation capacity and the overall performance. Secondly, the designed attention mechanism is introduced into both the feature fusion and pose iterative refinement networks to enhance the network's capacity to acquire local features thus improving its overall performance. The experimental results demonstrated that the estimation accuracy of DenseFusion-DA2 on the LineMOD dataset was approximately 3.4% higher than that of DenseFusion. Furthermore, the estimation accuracy surpassed that of PoseCNN, PVNet, SSD6D, and PointFusion by 8.3%, 11.1%, 20.3%, and 23.8%, respectively. The estimation accuracy also shows a significant advantage on the Occluded LineMOD and HR-Vision datasets. This research not only presents a more efficient solution for robot perception but also introduces novel ideas and methods for technological advancements and applications in related fields.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DenseFusion-DA2: End-to-End Pose-Estimation Network Based on RGB-D Sensors and Multi-Channel Attention Mechanisms.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Journal: Sensors (Basel, Switzerland)	Publication Date: Oct 15, 2024
License type: CC BY 4.0

Similar Papers

6D Object Pose Estimation Based on Cross-Modality Feature Fusion.
Meng Jiang ... Liming Zhang
Sensors (Basel, Switzerland) | VOL. 23
Meng Jiang, et. al.Meng Jiang ... Liming Zhang
26 Sep 2023
Sensors (Basel, Switzerland) | VOL. 23

Study on human pose estimation based on channel and spatial attention
Yilong Liu
-
Yilong LiuYilong Liu
06 Jan 2023
06 Jan 2023

Real-Time and Efficient 6-D Pose Estimation From a Single RGB Image
Jun Cheng ... Fei Wang
IEEE Transactions on Instrumentation and Measurement | VOL. 70
Jun Cheng, et. al.Jun Cheng ... Fei Wang
01 Jan 2020
IEEE Transactions on Instrumentation and Measurement | VOL. 70

Impact of Segmentation and Color Spaces in 6D Pose Estimation
Nuno Pereira ... Luis A Alexandre
-
Nuno Pereira, et. al.Nuno Pereira ... Luis A Alexandre
28 Apr 2021
28 Apr 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DenseFusion-DA2: End-to-End Pose-Estimation Network Based on RGB-D Sensors and Multi-Channel Attention Mechanisms.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)