Dual Attention-Guided Multiscale Dynamic Aggregate Graph Convolutional Networks for Skeleton-Based Human Action Recognition

Zeyuan Hu,Eung-Joo Lee

doi:10.3390/sym12101589

Abstract

Traditional convolution neural networks have achieved great success in human action recognition. However, it is challenging to establish effective associations between different human bone nodes to capture detailed information. In this paper, we propose a dual attention-guided multiscale dynamic aggregate graph convolution neural network (DAG-GCN) for skeleton-based human action recognition. Our goal is to explore the best correlation and determine high-level semantic features. First, a multiscale dynamic aggregate GCN module is used to capture important semantic information and to establish dependence relationships for different bone nodes. Second, the higher level semantic feature is further refined, and the semantic relevance is emphasized through a dual attention guidance module. In addition, we exploit the relationship of joints hierarchically and the spatial temporal correlations through two modules. Experiments with the DAG-GCN method result in good performance on the NTU-60-RGB+D and NTU-120-RGB+D datasets. The accuracy is 95.76% and 90.01%, respectively, for the cross (X)-View and X-Subon the NTU60dataset.

Highlights

Human action recognition is widely used in many scenarios, such as human-computer interaction [1], video retrieval [2], and medical treatment security [3]
dynamic aggregate graph convolutional network (DAG-graph convolutional network (GCN)) framework for skeleton-based human action recognition, and we provide the experimental results and analysis
We present the results of ablation experiments on multiscale dynamic aggregate operations and show the efficiency of the DAG-GCN recognition framework

Summary

Introduction

Human action recognition is widely used in many scenarios, such as human-computer interaction [1], video retrieval [2], and medical treatment security [3]. With the development of deep learning technology, human skeleton action recognition based on joint type, frame index, and 3D position identification has been widely studied. Compared with RGB human action video, skeleton data are more robust and computationally efficient. To improve the recognition accuracy of skeleton movements, researchers need to use deep learning technology to simulate the spatial-temporal nature of bone sequences [5,6]. RNN/LSTM uses short-term and long-term timing sequence dynamics to model the bone sequence, while CNN adjusts the bone data to the appropriate input (224 × 224) and learns the correlation

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Symmetry	Publication Date: Sep 24, 2020
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Dual Attention-Guided Multiscale Dynamic Aggregate Graph Convolutional Networks for Skeleton-Based Human Action Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry

Lead the way for us

Similar Papers

Prompt-supervised dynamic attention graph convolutional network for skeleton-based action recognition
Shasha Zhu ... Dongzhi He
Neurocomputing | VOL. 611
Shasha Zhu, et. al.Shasha Zhu ... Dongzhi He
17 Sep 2024
Neurocomputing | VOL. 611

Dynamic Semantic-Based Spatial Graph Convolution Network for Skeleton-Based Human Action Recognition
Jianyang Xie ... Xiaoyun Yang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Jianyang Xie, et. al.Jianyang Xie ... Xiaoyun Yang
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition
Chenyang Si ... Wentao Chen
-
Chenyang Si, et. al.Chenyang Si ... Wentao Chen
01 Jun 2019
01 Jun 2019

Symbiotic Graph Neural Networks for 3D Skeleton-Based Human Action Recognition and Motion Prediction.
Maosen Li ... Xu Chen
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44
Maosen Li, et. al.Maosen Li ... Xu Chen
22 Jan 2021
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dual Attention-Guided Multiscale Dynamic Aggregate Graph Convolutional Networks for Skeleton-Based Human Action Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry