A hybrid attention-guided ConvNeXt-GRU network for action recognition

Yiyuan An,Yingmin Yi,Xiaoyong Han,Li Wu,Chunyi Su,Bojun Liu,Xianghong Xue,Yankai Li

doi:10.1016/j.engappai.2024.108243

Abstract

In the digital age, with the continuous emergence of large-scale video data, video understanding has become increasingly important. As a core domain, action recognition has garnered widespread attention. However, video exhibits high-dimensional properties and contains human action information at multiple scales, which makes conventional attention mechanisms difficult to capture complex action information. To improve the performance of action recognition, a Hybrid Attention-guided ConvNeXt-GRU Network (HACG) is proposed. Specifically, a Novel Attention Mechanism (ANM) is constructed by integrating a parameter-free attention module into ConvNeXt, enabling the preliminary extraction of important features without the addition of extra parameters. Then, a Multiscale Hybrid Attention Module (MHAM) adopts an improved and efficient Selective Kernel Network (SKNet) to adaptively calibrate channel features. In this way, the module enhances the model’s ability to perceive features at different scales while improving the correlation between channels. Furthermore, MHAM incorporates an Atrous Spatial Pyramid Pooling (ASPP) to extract local and global information from different regions. Finally, MHAM is integrated with the Gated Recurrent Unit (GRU) to capture the interdependence between space and time. Experimental results show that HACG exhibits superior competitiveness compared with the state-of-the-art on the UCF-101, HMDB-51, and Kinetics-400 datasets. This indicates that HACG can more effectively capture important features to suppress noise interference while also having a lower computational load, which makes HACG a highly promising choice for action recognition tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A hybrid attention-guided ConvNeXt-GRU network for action recognition

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence

Lead the way for us

Journal: Engineering Applications of Artificial Intelligence	Publication Date: Mar 25, 2024
Citations: 2

Similar Papers

Fusion network based on the dual attention mechanism and atrous spatial pyramid pooling for automatic segmentation in retinal vessel images.
Bingtao Liang ... Min Xu
Journal of the Optical Society of America A | VOL. 39
Bingtao Liang, et. al.Bingtao Liang ... Min Xu
19 Jul 2022
Journal of the Optical Society of America A | VOL. 39

Recognition of Real-Time Video Activities Using Stacked Bi-GRU with Fusion-based Deep Architecture
Ujwala Thakur ... Ankit Vidyarthi
JUCS - Journal of Universal Computer Science | VOL. 30
Ujwala Thakur, et. al.Ujwala Thakur ... Ankit Vidyarthi
28 Sep 2024
JUCS - Journal of Universal Computer Science | VOL. 30

Two-Level Attention Module Based on Spurious-3D Residual Networks for Human Action Recognition.
Bo Chen ... Fangzhou Meng
Sensors (Basel, Switzerland) | VOL. 23
Bo Chen, et. al.Bo Chen ... Fangzhou Meng
03 Feb 2023
Sensors (Basel, Switzerland) | VOL. 23

Metric-Based Attention Feature Learning for Video Action Recognition
Dae Ha Kim ... Fazliddin Anvarov
IEEE Access | VOL. 9
Dae Ha Kim, et. al.Dae Ha Kim ... Fazliddin Anvarov
01 Jan 2020
IEEE Access | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A hybrid attention-guided ConvNeXt-GRU network for action recognition

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence