3D RANs: 3D Residual Attention Networks for action recognition

Jiahui Cai,Jianguo Hu

doi:10.1007/s00371-019-01733-3

Abstract

In this work, we propose 3D Residual Attention Networks (3D RANs) for action recognition, which can learn spatiotemporal representation from videos. The proposed network consists of attention mechanism and 3D ResNets architecture, and it can capture spatiotemporal information in an end-to-end manner. Specifically, we separately add the attention mechanism along channel and spatial domain to each block of 3D ResNets. For each sliced tensor of an intermediate feature map, we sequentially infer channel and spatial attention maps by channel and spatial attention mechanism submodules in each residual unit block, and the attention maps are multiplied to the input feature map to reweight the key features. We validate our network through extensive experiments in UCF-101, HMDB-51 and Kinetics datasets. Our experiments show that the proposed 3D RANs are superior to the state-of-the-art approaches for action recognition, demonstrating the effectiveness of our networks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

3D RANs: 3D Residual Attention Networks for action recognition

Abstract

Talk to us

Similar Papers

More From: The Visual Computer

Lead the way for us

Journal: The Visual Computer	Publication Date: Jul 25, 2019
Citations: 27

Similar Papers

3D Residual Networks with Channel-Spatial Attention Module for Action Recognition
Ziwen Yi ... Kebin Jia
-
Ziwen Yi, et. al.Ziwen Yi ... Kebin Jia
06 Nov 2020
06 Nov 2020

Two-Level Attention Module Based on Spurious-3D Residual Networks for Human Action Recognition.
Bo Chen ... Fangzhou Meng
Sensors (Basel, Switzerland) | VOL. 23
Bo Chen, et. al.Bo Chen ... Fangzhou Meng
03 Feb 2023
Sensors (Basel, Switzerland) | VOL. 23

Spatio-temporal-based multi-level aggregation network for physical action recognition
Yuhang Wang
Computer Science and Information Systems | VOL. 21
Yuhang WangYuhang Wang
01 Jan 2024
Computer Science and Information Systems | VOL. 21

Super-resolution reconstruction of binocular image based on multi-level fusion attention network
Lei Xu ... Huihui Song
Journal of Image and Graphics | VOL. 28
Lei Xu, et. al.Lei Xu ... Huihui Song
01 Jan 2023
Journal of Image and Graphics | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

3D RANs: 3D Residual Attention Networks for action recognition

Abstract

Talk to us

Similar Papers

More From: The Visual Computer