Adaptive spatiotemporal graph convolutional network with intermediate aggregation of multi-stream skeleton features for action recognition

Yukai Zhao,Jingwei Wang,Han Wang,Min Liu,Yunlong Ma

doi:10.1016/j.neucom.2022.07.046

Abstract

Video-based action recognition is a challenging problem due to the rapid and uncertain changes in human actions. Recent studies show that incorporating video and human body skeleton helps improve action recognition performance. These methods generally use graph convolutional networks (GCNs) to extract structural features of the human body joints from skeleton data. Yet, most GCN-based methods have some limitations in skeleton-based action recognition. (1) The graph structure of the human body joints is time-invariant, making it difficult to represent the changing relationship between joints across actions. (2) Methods relying on single-stream models only utilize limited information of skeleton data, such as joints or bones, and fail to consider coherent features of movements. (3) Methods relying on multi-stream models have considerable parameters and are inefficient for real-life applications. To address these problems, we propose an adaptive spatiotemporal graph convolutional network with intermediate aggregation of multi-stream skeleton features for action recognition. First, our method learns an adaptive graph structure representing the changing relationship between joints. Secondly, we facilitate a multi-stream model to extract various features from the skeleton, including joint-stream, bone-stream, and motion-stream. Moreover, an intermediate aggregation strategy is employed to aggregate these features and to reduce the parameters of this model. The proposed method has been validated on various benchmarks and a real-world abnormal action dataset. Extensive experimental results show that our method achieves excellent performance in skeleton-based action recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adaptive spatiotemporal graph convolutional network with intermediate aggregation of multi-stream skeleton features for action recognition

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Jul 16, 2022
Citations: 9

Similar Papers

Action Recognition Based on the Fusion of Graph Convolutional Networks with High Order Features
Jiuqing Dong ... Hyo Jong Lee
Applied Sciences | VOL. 10
Jiuqing Dong, et. al.Jiuqing Dong ... Hyo Jong Lee
21 Feb 2020
Applied Sciences | VOL. 10

Pose-Guided Graph Convolutional Networks for Skeleton-Based Action Recognition
Han Chen ... Hanseok Ko
IEEE Access | VOL. 10
Han Chen, et. al.Han Chen ... Hanseok Ko
01 Jan 2021
IEEE Access | VOL. 10

Feedback Graph Convolutional Network for Skeleton-Based Action Recognition.
Hao Yang ... Stephen J Maybank
IEEE Transactions on Image Processing | VOL. 31
Hao Yang, et. al.Hao Yang ... Stephen J Maybank
01 Jan 2021
IEEE Transactions on Image Processing | VOL. 31

A graph convolutional neural network model with Fisher vector encoding and channel‐wise spatial‐temporal aggregation for skeleton‐based action recognition
Jun Tang ... Weifeng Liu
IET Image Processing | VOL. 16
Jun Tang, et. al.Jun Tang ... Weifeng Liu
17 Jan 2022
IET Image Processing | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive spatiotemporal graph convolutional network with intermediate aggregation of multi-stream skeleton features for action recognition

Abstract

Talk to us

Similar Papers

More From: Neurocomputing