Adaptive Attention Memory Graph Convolutional Networks for Skeleton-Based Action Recognition.

Di Liu,Miao Qi,Yinghua Lu,Hui Xu,Jianzhong Wang,Jun Kong

doi:10.3390/s21206761

Di Liu, Miao Qi + Show 4 more

Open Access

https://doi.org/10.3390/s21206761

Copy DOI

Journal: Sensors (Basel, Switzerland)	Publication Date: Oct 12, 2021
Citations: 4	License type: CC BY 4.0

Affiliation: Northeast Normal University

Abstract

Graph Convolutional Networks (GCNs) have attracted a lot of attention and shown remarkable performance for action recognition in recent years. For improving the recognition accuracy, how to build graph structure adaptively, select key frames and extract discriminative features are the key problems of this kind of method. In this work, we propose a novel Adaptive Attention Memory Graph Convolutional Networks (AAM-GCN) for human action recognition using skeleton data. We adopt GCN to adaptively model the spatial configuration of skeletons and employ Gated Recurrent Unit (GRU) to construct an attention-enhanced memory for capturing the temporal feature. With the memory module, our model can not only remember what happened in the past but also employ the information in the future using multi-bidirectional GRU layers. Furthermore, in order to extract discriminative temporal features, the attention mechanism is also employed to select key frames from the skeleton sequence. Extensive experiments on Kinetics, NTU RGB+D and HDM05 datasets show that the proposed network achieves better performance than some state-of-the-art methods.

Highlights

Research on human action recognition has become one of the most active issues in the computer vision area in recent years
The detailed process is described below: In Graph Convolutional Networks (GCNs)-based action recognition works [18,20], the dynamics of the human skeleton In GCN-based action recognition works [18,20], the dynamics of the human skeleton sequence for performing actions with N joints and T frames are denoted as a spatialsequence for performing actions with N joints and T frames are denoted as a spatialtemporal graph G = (V, E)
The results demonstrate that what happened in the past is helpful, and the future information is important for action recognition

Summary

Introduction

Research on human action recognition has become one of the most active issues in the computer vision area in recent years. It employs graph convolution to adaptively construct the spatial configuration in one frame and uses multiple bidirectional GRU layers to extract temporal information. The advantage of our proposed method could be described as: (1) The constructed adaptive graph can effectively capture the latent dependencies between arbitrary joints, including the ones which do not have a physical connection but have strong correlations in the actions, which is more suitable for real actions that need the collaboration of different body parts. We propose an AAM-GCN network to model dynamic skeletons for action recognition, which can construct the graph structure adaptively during the training process and explicitly explore the latent dependency among the joints. By constructing an attention-enhanced memory, AAM-GCN can selectively focus on key frames and capture both long-range discriminative temporal features in the past and the future. We conduct an ablation study to demonstrate the effectiveness of each individual part of our model

Related Works

Graph Convolutional Networks

Illustration

Adaptive

Attention

Model Architecture and Training Detail

Experiments

Datasets

Comparisons the State-of-the-Art

Methods

Visualization of the Actions

Effect of Adaptive Graph

Effect of Bidirectional

Compared with configuration

Effect of ASGC Concatenation

Other Parameters Evaluation

Findings

Conclusions and Future Work

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adaptive Attention Memory Graph Convolutional Networks for Skeleton-Based Action Recognition.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks.
Lei Shi ... Yifan Zhang
IEEE Transactions on Image Processing | VOL. PP
Lei Shi, et. al.Lei Shi ... Yifan Zhang
01 Jan 2020
IEEE Transactions on Image Processing | VOL. PP

A Spatial Attention-Enhanced Multi-Timescale Graph Convolutional Network for Skeleton-Based Action Recognition
Shuqiong Zhu ... Xiaolu Ding
-
Shuqiong Zhu, et. al.Shuqiong Zhu ... Xiaolu Ding
26 Jun 2020
26 Jun 2020

An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition
Chenyang Si ... Liang Wang
-
Chenyang Si, et. al.Chenyang Si ... Liang Wang
01 Jun 2019
01 Jun 2019

Forward-reverse adaptive graph convolutional networks for skeleton-based action recognition
Zesheng Hu ... Shumin Fei
Neurocomputing | VOL. 492
Zesheng Hu, et. al.Zesheng Hu ... Shumin Fei
24 Dec 2021
Neurocomputing | VOL. 492

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive Attention Memory Graph Convolutional Networks for Skeleton-Based Action Recognition.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)