Abstract

Dynamic hand gesture recognition has evolved as a prominent topic of computer vision research due to its vast applications in human-computer interaction, robotics, and other domains. Although there are numerous related recognition studies, the State-Of-The-Art (SOTA) methods are overparametrized. Specifically, the number of model parameters is quite large, which results in high computational costs. This work, referring to Songs ResGCN, designs an efficient and lightweight graph convolutional network, named ResGCNeXt. ResGCNeXt learns rich features from skeleton information and achieves high accuracy with less number of model parameters. First, three data preprocessing strategies according to motion analysis are designed to provide sufficient features for the recognition model. Then, an efficient graph convolutional network structure combining bottleneck and group convolution is designed to reduce the number of model parameters without loss of accuracy. Furthermore, an attention block called SENet-Part attention (SEPA) is added to improve channel and spatial feature learning. This study is validated on two benchmark datasets, and the experimental results show that ResGCNeXt provides competitive performance, especially in significantly reducing the number of model parameters. Compared to HAN-2S, which is one of the best SOTA methods, our method has half model parameters and a 0.3% higher recognition rate.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.