MPCTrans: Multi-Perspective Cue-Aware Joint Relationship Representation for 3D Hand Pose Estimation via Swin Transformer.

Xiangan Wan,Jianping Ju,Jianying Tang,Mingyu Lin,Ning Rao,Deng Chen,Tingting Liu,Jing Li,Fan Bian,Nicholas Xiong

doi:10.3390/s24217029

Abstract

The objective of 3D hand pose estimation (HPE) based on depth images is to accurately locate and predict keypoints of the hand. However, this task remains challenging because of the variations in hand appearance from different viewpoints and severe occlusions. To effectively address these challenges, this study introduces a novel approach, called the multi-perspective cue-aware joint relationship representation for 3D HPE via the Swin Transformer (MPCTrans, for short). This approach is designed to learn multi-perspective cues and essential information from hand depth images. To achieve this goal, three novel modules are proposed to utilize features from multiple virtual views of the hand, namely, the adaptive virtual multi-viewpoint (AVM), hierarchy feature estimation (HFE), and virtual viewpoint evaluation (VVE) modules. The AVM module adaptively adjusts the angles of the virtual viewpoint and learns the ideal virtual viewpoint to generate informative multiple virtual views. The HFE module estimates hand keypoints through hierarchical feature extraction. The VVE module evaluates virtual viewpoints by using chained high-level functions from the HFE module. Transformer is used as a backbone to extract the long-range semantic joint relationships in hand depth images. Extensive experiments demonstrate that the MPCTrans model achieves state-of-the-art performance on four challenging benchmark datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MPCTrans: Multi-Perspective Cue-Aware Joint Relationship Representation for 3D Hand Pose Estimation via Swin Transformer.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Journal: Sensors (Basel, Switzerland)	Publication Date: Oct 31, 2024
License type: cc-by

Similar Papers

Real Time 3D Pose Estimation of Both Human Hands via RGB-Depth Camera and Deep Convolutional Neural Networks
Geon Gi ... Hye Min Park
-
Geon Gi, et. al.Geon Gi ... Hye Min Park
06 Jun 2019
06 Jun 2019

3D hand pose and mesh estimation via a generic Topology-aware Transformer model.
Shaoqi Yu ... Yintong Wang
Frontiers in Neurorobotics | VOL. 18
Shaoqi Yu, et. al.Shaoqi Yu ... Yintong Wang
03 May 2024
Frontiers in Neurorobotics | VOL. 18

Robust 3D Hand Pose Estimation From Single Depth Images Using Multi-View CNNs.
Liuhao Ge ... Junsong Yuan
IEEE Transactions on Image Processing | VOL. 27
Liuhao Ge, et. al.Liuhao Ge ... Junsong Yuan
01 Sep 2018
IEEE Transactions on Image Processing | VOL. 27

SO-HandNet: Self-Organizing Network for 3D Hand Pose Estimation With Semi-Supervised Learning
Yujin Chen ... Liuhao Ge
-
Yujin Chen, et. al.Yujin Chen ... Liuhao Ge
01 Oct 2019
01 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MPCTrans: Multi-Perspective Cue-Aware Joint Relationship Representation for 3D Hand Pose Estimation via Swin Transformer.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)