InterNet+: A Light Network for Hand Pose Estimation.

Yang Liu,Jie Jiang,Xianghan Wang,Jiahao Sun

doi:10.3390/s21206747

Yang Liu, Jie Jiang + Show 2 more

Open Access

https://doi.org/10.3390/s21206747

Copy DOI

Abstract

Hand pose estimation from RGB images has always been a difficult task, owing to the incompleteness of the depth information. Moon et al. improved the accuracy of hand pose estimation by using a new network, InterNet, through their unique design. Still, the network still has potential for improvement. Based on the architecture of MobileNet v3 and MoGA, we redesigned a feature extractor that introduced the latest achievements in the field of computer vision, such as the ACON activation function and the new attention mechanism module, etc. Using these modules effectively with our network, architecture can better extract global features from an RGB image of the hand, leading to a greater performance improvement compared to InterNet and other similar networks.

Highlights

We evaluated our InterNet+ network on RGB datasets used for hand pose estimation, including the stereo hand pose tracking benchmark (STB) and Rendered Handpose Dataset (RHD) datasets, and the feasibility test conducted on the incomplete
We use widely used mean end point error (EPE, according to reference [3], which is defined as a mean Euclidean distance between the predicted and ground-truth 3D hand pose after root joint alignment) as the evaluation metrics for STB dataset and RHD dataset
Considering experience, the best result is generally obtained near the epoch where the learning rate is about to converge to 0; the STB result is taken from 45 epochs, and the RHD training result is taken from 189 epochs

Summary

Introduction

Using deep learning methods to estimate hand pose based on a RGB image, one of the possible methods is InterNet, which was introduced by [8]. InterNet accurately estimates the posture position of the hand by inputting an annotated RGB image using a deep neural network feature extractor and subsequent heatmap estimation and position-fitting with the fully connected network. For the purpose of achieving the potential of the original InterNet structure and verify whether the current achievements can be effectively applied to the field of hand pose estimation, as well as improving the performance in multiple datasets, we relied on the recent developments in this field to update the original method and achieved greater improvements on multiple datasets.

Related Work

Original InterNet

Network Structure

Specific

Redesigned Feature Extraction Network

Inverted Residual Block

Coordinate

Feature

Processing of the Feature Maps

Effective Way of Network Training

Experiment

Datasets

STB Dataset

RHD Dataset

Experimental Environment and Results

Methods

Convergence

Coordinate Attention Mechanism Module

Processing of Feature Map by Using the FcaNet Layer

Discussion and Outlook

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Oct 11, 2021
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

InterNet+: A Light Network for Hand Pose Estimation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

3D Hand Pose Estimation from RGB Using Privileged Learning with Depth Data
Bjorn Stenger ... Shanxin Yuan
-
Bjorn Stenger, et. al.Bjorn Stenger ... Shanxin Yuan
01 Oct 2019
01 Oct 2019

Hand pose estimation based on improved NSRM network
Jinhua Wang ... Duo He
EURASIP Journal on Advances in Signal Processing | VOL. 2023
Jinhua Wang, et. al.Jinhua Wang ... Duo He
12 Jan 2023
EURASIP Journal on Advances in Signal Processing | VOL. 2023

CrossFuNet: RGB and Depth Cross-Fusion Network for Hand Pose Estimation.
Yan Ma ... Qian Zhang
Sensors (Basel, Switzerland) | VOL. 21
Yan Ma, et. al.Yan Ma ... Qian Zhang
11 Sep 2021
Sensors (Basel, Switzerland) | VOL. 21

Graph-Based CNNs With Self-Supervised Module for 3D Hand Pose Estimation From Monocular RGB
Xinghui Dong ... Haiyan Li
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 31
Xinghui Dong, et. al.Xinghui Dong ... Haiyan Li
23 Jun 2020
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

InterNet+: A Light Network for Hand Pose Estimation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)