Motion-Driven Spatial and Temporal Adaptive High-Resolution Graph Convolutional Networks for Skeleton-Based Action Recognition

Zengxi Huang,Yiguang Liu,Zhenhua Feng,Yusong Qin,Tianlin Liu,Xiaobing Lin

doi:10.1109/tcsvt.2022.3217763

Abstract

Graph convolutional networks (GCN) have attracted increasing interest in action recognition in recent years. GCN models human skeleton sequences as spatio-temporal graphs. Also, attention mechanisms are often jointly used with GCNs to highlight important frames or body joints in a sequence. However, attention modules learn parameters offline and are fixed, so may not adapt well to unseen samples. In this paper, we propose a simple but effective motion-driven spatial and temporal adaptation strategy to dynamically strengthen the features of important frames and joints for skeleton-based action recognition. The rationale is that the joints and frames with dramatic motions are generally more informative and discriminative. We combine the spatial and temporal refinements by using a two-branch structure, in which the joint and frame-wise feature refinements perform in parallel. Such a structure can lead to learn more complementary feature representations. Moreover, we propose to use the fully connected graph convolution to learn the long-range spatial dependencies. Besides, we investigate two high-resolution skeleton graphs by creating virtual joints, aiming to improve the representation of skeleton features. By combining the above proposals, we develop a novel motion-driven spatial and temporal adaptive high-resolution GCN. Experimental results demonstrate that the proposed model achieves state-of-the-art (SOTA) results on the challenging large-scale Kinetics-Skeleton and UAV-Human datasets, and it is on par with the SOTA methods on the two NTU-RGB+D 60&120 datasets. Additionally, our motion-driven adaptation method shows encouraging performance when compared with the attention mechanisms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Motion-Driven Spatial and Temporal Adaptive High-Resolution Graph Convolutional Networks for Skeleton-Based Action Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Apr 1, 2023
Citations: 13

Similar Papers

Learning Temporal and Spatial Features Jointly: A Unified Framework for Space-Time Data Prediction in Industrial IoT Networks
Yinghui Zhang ... Hu An
IEEE Sensors Journal | VOL. 23
Yinghui Zhang, et. al.Yinghui Zhang ... Hu An
15 Aug 2023
IEEE Sensors Journal | VOL. 23

An Attention Enhanced Spatial–Temporal Graph Convolutional LSTM Network for Action Recognition in Karate
Jianping Guo ... Yihan Zhang
Applied Sciences | VOL. 11
Jianping Guo, et. al.Jianping Guo ... Yihan Zhang
17 Sep 2021
Applied Sciences | VOL. 11

A Self-Attention Augmented Graph Convolutional Clustering Networks for Skeleton-Based Video Anomaly Behavior Detection
Chengming Liu ... Weiwei Li
Applied Sciences | VOL. 12
Chengming Liu, et. al.Chengming Liu ... Weiwei Li
21 Dec 2021
Applied Sciences | VOL. 12

Multi-Scale Spatial Temporal Graph Convolutional LSTM Network for Skeleton-Based Human Action Recognition
Kai Yang ... Xiaolu Ding
-
Kai Yang, et. al.Kai Yang ... Xiaolu Ding
29 Oct 2019
29 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Motion-Driven Spatial and Temporal Adaptive High-Resolution Graph Convolutional Networks for Skeleton-Based Action Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology