Skeleton Action Recognition Based on Transformer Adaptive Graph Convolution

Yue Meng,Wenlu Yang,Mengqi Shi

doi:10.1088/1742-6596/2170/1/012007

Yue Meng, Wenlu Yang + Show 1 more

https://doi.org/10.1088/1742-6596/2170/1/012007

Copy DOI

Abstract

Action recognition is of great significance in the field of machine vision. In recent years, great progress has been made in bone point-based action recognition models, but there is no much research on weak feature extraction of bone, leading to insufficient generalization of the trained models. This experiment proposes to use the Transformer structure and its attention mechanism to extract image features as input to Transformer to capture their behavior after extraction via GCN. Furthermore, the experiments were optimized based on the original ST-GCN model, introducing an adaptive graph convolutional layer to increase its flexibility and add attention mechanisms to a separate spatiotemporal channel module to further enhance the adaptive graph convolutional layer. Experiments on the NTU-RGBD dataset show that the model shows some improvement in the accuracy of action recognition.

Full Text