Interaction-Aware Spatio-Temporal Pyramid Attention Networks for Action Classification.

Weiming Hu,Chunfeng Yuan,Stephen Maybank,Yang Du,Bing Li,Haowei Liu

doi:10.1109/tpami.2021.3100277

Abstract

For CNN-based visual action recognition, the accuracy may be increased if local key action regions are focused on. The task of self-attention is to focus on key features and ignore irrelevant information. So, self-attention is useful for action recognition. However, current self-attention methods usually ignore correlations among local feature vectors at spatial positions in CNN feature maps. In this paper, we propose an effective interaction-aware self-attention model which can extract information about the interactions between feature vectors to learn attention maps. Since the different layers in a network capture feature maps at different scales, we introduce a spatial pyramid with the feature maps at different layers for attention modeling. The multi-scale information is utilized to obtain more accurate attention scores. These attention scores are used to weight the local feature vectors of the feature maps and then calculate attentional feature maps. Since the number of feature maps input to the spatial pyramid attention layer is unrestricted, we easily extend this attention layer to a spatio-temporal version. Our model can be embedded in any general CNN to form a video-level end-to-end attention network for action recognition. Several methods are investigated to combine the RGB and flow streams to obtain accurate predictions of human actions. Experimental results show that our method achieves state-of-the-art results on the datasets UCF101, HMDB51, Kinetics-400, and untrimmed Charades.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Oct 1, 2022
Citations: 6	License type: other-oa

R Discovery Prime

R Discovery Prime

Interaction-Aware Spatio-Temporal Pyramid Attention Networks for Action Classification.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Similar Papers

Interaction-Aware Spatio-Temporal Pyramid Attention Networks for Action Classification
Yang Du ... Chunfeng Yuan
-
Yang Du, et. al.Yang Du ... Chunfeng Yuan
01 Jan 2018
01 Jan 2018

Heart biometrics based on ECG signal by sparse coding and bidirectional long short-term memory
Yefei Zhang ... Yu Zhang
Multimedia Tools and Applications | VOL. 80
Yefei Zhang, et. al.Yefei Zhang ... Yu Zhang
26 Aug 2020
Multimedia Tools and Applications | VOL. 80

Center-Symmetric Local Binary Pattern-Based Image Authentication Using Local and Global Features Vector
Ashis Dey ... Biswapati Jana
-
Ashis Dey, et. al.Ashis Dey ... Biswapati Jana
01 Jan 2020
01 Jan 2020

Class-specific GMM based intermediate matching kernel for classification of varying length patterns of long duration speech using support vector machines
A.D Dileep ... C Chandra Sekhar
Speech Communication | VOL. 57
A.D Dileep, et. al.A.D Dileep ... C Chandra Sekhar
07 Oct 2013
Speech Communication | VOL. 57

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Interaction-Aware Spatio-Temporal Pyramid Attention Networks for Action Classification.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence