PANet: A Pixel-Level Attention Network for 6D Pose Estimation With Embedding Vector Features

Tao Xie,Lijun Zhao,Ruifeng Li,Xinyue Tang,Ke Wang

doi:10.1109/lra.2021.3136873

Tao Xie, Lijun Zhao + Show 3 more

Open Access

https://doi.org/10.1109/lra.2021.3136873

Copy DOI

Abstract

In this work, we present PANet, a pixel-level attention network with embedding vector features, which addresses the challenge of 6D pose estimation from a single RGBD image under severe occlusion. PANet produces pixel-wise attention for strong representation learning and leverages a novel selection scheme for robust pose estimation. Specifically, at the representation learning stage, we devise Pyramid Pixel-level Attention Module that unites attention mechanism with spatial pyramid to learn a discriminative representation, and Attention Upsample Module that utilizes arbitrary combinations of the CNN encoders’ feature maps to recover precise pixel-wise prediction, after which we embed the two modules into CNN to gain rich appearance features from RGB images. For depth images, we apply the current advanced point cloud network adopting attention mechanism to earn geometry features, which are further fused with the appearance features to obtain point-wise dense feature embedding. In the pose estimation stage, we define point-wise embedding vector features which can provide rich viewpoint information to better cope with the case of occluded objects. Further, a novel and effective RANSAC-based Selection Scheme is also founded to select vector features with high scores for pose estimation. Extensive experimental results manifest that our method outperforms the state-of-the-art by large margins on several benchmarks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Robotics and Automation Letters	Publication Date: Apr 1, 2022
Citations: 9	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

PANet: A Pixel-Level Attention Network for 6D Pose Estimation With Embedding Vector Features

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters

Lead the way for us

Similar Papers

FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation
Yisheng He ... Jian Sun
-
Yisheng He, et. al.Yisheng He ... Jian Sun
01 Jun 2021
01 Jun 2021

CSA6D: Channel-Spatial Attention Networks for 6D Object Pose Estimation
Tao Chen ... Dongbing Gu
Cognitive Computation | VOL. 14
Tao Chen, et. al.Tao Chen ... Dongbing Gu
29 Nov 2021
Cognitive Computation | VOL. 14

6DoF Pose Estimation of Transparent Object from a Single RGB-D Image.
Chi Xu ... Lijun Zhang
Sensors (Basel, Switzerland) | VOL. 20
Chi Xu, et. al.Chi Xu ... Lijun Zhang
27 Nov 2020
Sensors (Basel, Switzerland) | VOL. 20

NormalGAN: Learning Detailed 3D Human from a Single RGB-D Image
Lizhen Wang ... Tao Yu
-
Lizhen Wang, et. al.Lizhen Wang ... Tao Yu
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PANet: A Pixel-Level Attention Network for 6D Pose Estimation With Embedding Vector Features

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters