CFAM: Estimating 3D Hand Poses from a Single RGB Image with Attention

Xianghan Wang,Lai Kang,Jie Jiang,Yanming Guo,Dan Li,Yingmei Wei

doi:10.3390/app10020618

Abstract

Precise 3D hand pose estimation can be used to improve the performance of human–computer interaction (HCI). Specifically, computer-vision-based hand pose estimation can make this process more natural. Most traditional computer-vision-based hand pose estimation methods use depth images as the input, which requires complicated and expensive acquisition equipment. Estimation through a single RGB image is more convenient and less expensive. Previous methods based on RGB images utilize only 2D keypoint score maps to recover 3D hand poses but ignore the hand texture features and the underlying spatial information in the RGB image, which leads to a relatively low accuracy. To address this issue, we propose a channel fusion attention mechanism that combines 2D keypoint features and RGB image features at the channel level. In particular, the proposed method replans weights by using cascading RGB images and 2D keypoint features, which enables rational planning and the utilization of various features. Moreover, our method improves the fusion performance of different types of feature maps. Multiple contrast experiments on public datasets demonstrate that the accuracy of our proposed method is comparable to the state-of-the-art accuracy.

Highlights

Gesture estimation plays a significant role in computer science, and related tasks aim toward understanding human gestures through algorithms
We introduce estimation methods based on depth images and RGB images in this chapter
Based on 3D hand estimation from a single RGB image, we propose a method that uses the attention mechanism to fuse the 2D score map and the RGB image channel

Summary

Introduction

Gesture estimation plays a significant role in computer science, and related tasks aim toward understanding human gestures through algorithms. Human–computer interaction (HCI) can be implemented wherever and whenever, has fewer constraints, and enables computers to efficiently and precisely understand user commands without any mechanical assistance. Gestures for HCI are quick, vivid, intuitive, flexible, and visual; they can enable soundless interactions and bridge the gap between the real world and virtual worlds. Computer-vision-based hand pose estimation enables people to communicate with machines more naturally. With the development of computer vision, pose estimation no longer relies on traditional wearable devices in specific scenes but can be directly implemented based on image recognition. The research on pose estimation in computer vision includes three main categories: depth images, multivision RGB images, and single RGB images

Methods

Results

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Jan 15, 2020
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

CFAM: Estimating 3D Hand Poses from a Single RGB Image with Attention

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

3D Hand Pose Estimation from RGB Using Privileged Learning with Depth Data
Shanxin Yuan ... Tae-Kyun Kim
-
Shanxin Yuan, et. al.Shanxin Yuan ... Tae-Kyun Kim
01 Oct 2019
01 Oct 2019

Graph-Based CNNs With Self-Supervised Module for 3D Hand Pose Estimation From Monocular RGB
Shaoxiang Guo ... Junyu Dong
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 31
Shaoxiang Guo, et. al.Shaoxiang Guo ... Junyu Dong
23 Jun 2020
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 31

Real Time 3D Pose Estimation of Both Human Hands via RGB-Depth Camera and Deep Convolutional Neural Networks
Geon Gi ... Tae Yeon Kim
-
Geon Gi, et. al.Geon Gi ... Tae Yeon Kim
06 Jun 2019
06 Jun 2019

Cascaded Hierarchical CNN for RGB-Based 3D Hand Pose Estimation
Shiming Dai ... Wenji Yang
Mathematical Problems in Engineering | VOL. 2020
Shiming Dai, et. al.Shiming Dai ... Wenji Yang
15 Jul 2020
Mathematical Problems in Engineering | VOL. 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CFAM: Estimating 3D Hand Poses from a Single RGB Image with Attention

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Applied Sciences