An Efficient Graph Convolution Network for Skeleton-Based Dynamic Hand Gesture Recognition

Sheng-Hui Peng,Pei-Hsuan Tsai

doi:10.1109/tcds.2023.3242988

Abstract

Dynamic hand gesture recognition has evolved as a prominent topic of computer vision research due to its vast applications in human-computer interaction, robotics, and other domains. Although there are numerous related recognition studies, the State-Of-The-Art (SOTA) methods are overparametrized. Specifically, the number of model parameters is quite large, which results in high computational costs. This work, referring to Songs ResGCN, designs an efficient and lightweight graph convolutional network, named ResGCNeXt. ResGCNeXt learns rich features from skeleton information and achieves high accuracy with less number of model parameters. First, three data preprocessing strategies according to motion analysis are designed to provide sufficient features for the recognition model. Then, an efficient graph convolutional network structure combining bottleneck and group convolution is designed to reduce the number of model parameters without loss of accuracy. Furthermore, an attention block called SENet-Part attention (SEPA) is added to improve channel and spatial feature learning. This study is validated on two benchmark datasets, and the experimental results show that ResGCNeXt provides competitive performance, especially in significantly reducing the number of model parameters. Compared to HAN-2S, which is one of the best SOTA methods, our method has half model parameters and a 0.3% higher recognition rate.

Full Text