Channel-Wise Spatial Attention with Spatiotemporal Heterogeneous Framework for Action Recognition

Yiying Li,Yanfei Gu,Yulin Li

doi:10.1145/3404555.3404592

Abstract

Recent years have witnessed the effective of attention network based on two-stream for video action recognition. However, most methods adopt the same structure on spatial stream and temporal stream, which produce amount redundant information and often ignore the relevance among channels. In this paper, we propose a channel-wise spatial attention with spatiotemporal heterogeneous framework, a new approach to action recognition. First, we employ two different network structures for spatial stream and temporal stream to improve the performance of action recognition. Then, we design a channel-wise network and spatial network inspired by self-attention mechanism to obtain the fine-grained and salient information of the video. Finally, the feature of video for action recognition is generated by end-to-end training. Experimental results on the datasets HMDB51 and UCF101 shows our method can effectively recognize the actions in the video.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Channel-Wise Spatial Attention with Spatiotemporal Heterogeneous Framework for Action Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Learning Video Actions in Two Stream Recurrent Neural Network
Ehtesham Hassan
Pattern Recognition Letters | VOL. 151
Ehtesham HassanEhtesham Hassan
01 Nov 2021
Pattern Recognition Letters | VOL. 151

Spatiotemporal Saliency Based Multi-stream Networks for Action Recognition
Zhenbing Liu ... Ruili Wang
-
Zhenbing Liu, et. al.Zhenbing Liu ... Ruili Wang
01 Jan 2020
01 Jan 2020

Human Action Recognition Based on Improved Two-Stream Convolution Network
Zhongwen Wang ... Junlan Jin
Applied Sciences | VOL. 12
Zhongwen Wang, et. al.Zhongwen Wang ... Junlan Jin
07 Jun 2022
Applied Sciences | VOL. 12

Multi-head attention-based two-stream EfficientNet for action recognition
Aihua Zhou ... Yujun Ma
Multimedia Systems | VOL. 29
Aihua Zhou, et. al.Aihua Zhou ... Yujun Ma
24 Jun 2022
Multimedia Systems | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Channel-Wise Spatial Attention with Spatiotemporal Heterogeneous Framework for Action Recognition

Abstract

Talk to us

Similar Papers