Spatiotemporal attention enhanced features fusion network for action recognition

Danfeng Zhuang,Jun Kong,Tianshan Liu,Min Jiang

doi:10.1007/s13042-020-01204-5

Abstract

In recent years, action recognition has become a popular and challenging task in computer vision. Nowadays, two-stream networks with appearance stream and motion stream can make judgment jointly and get excellent action classification results. But many of these networks fused the features or scores simply, and the characteristics in different streams were not utilized effectively. Meanwhile, the spatial context and temporal information were not fully utilized and processed in some networks. In this paper, a novel three-stream network spatiotemporal attention enhanced features fusion network for action recognition is proposed. Firstly, features fusion stream which includes multi-level features fusion blocks, is designed to train the two streams jointly and complement the two-stream network. Secondly, we model the channel features obtained by spatial context to enhance the ability to extract useful spatial semantic features at different levels. Thirdly, a temporal attention module which can model the temporal information makes the extracted temporal features more representative. A large number of experiments are performed on UCF101 dataset and HMDB51 dataset, which verify the effectiveness of our proposed network for action recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Spatiotemporal attention enhanced features fusion network for action recognition

Abstract

Talk to us

Similar Papers

More From: International Journal of Machine Learning and Cybernetics

Lead the way for us

Journal: International Journal of Machine Learning and Cybernetics	Publication Date: Oct 12, 2020
Citations: 11

Similar Papers

Spatial-temporal interaction learning based two-stream network for action recognition
Tianyu Liu ... Ping Jiang
Information Sciences | VOL. 606
Tianyu Liu, et. al.Tianyu Liu ... Ping Jiang
28 May 2022
Information Sciences | VOL. 606

An End to End Framework With Adaptive Spatio-Temporal Attention Module for Human Action Recognition
Shaocan Liu ... Hanbo Wu
IEEE Access | VOL. 8
Shaocan Liu, et. al.Shaocan Liu ... Hanbo Wu
01 Jan 2020
IEEE Access | VOL. 8

Learning Video Actions in Two Stream Recurrent Neural Network
Ehtesham Hassan
Pattern Recognition Letters | VOL. 151
Ehtesham HassanEhtesham Hassan
01 Nov 2021
Pattern Recognition Letters | VOL. 151

Multi-stream with Deep Convolutional Neural Networks for Human Action Recognition in Videos
Xiao Liu ... Xudong Yang
-
Xiao Liu, et. al.Xiao Liu ... Xudong Yang
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Spatiotemporal attention enhanced features fusion network for action recognition

Abstract

Talk to us

Similar Papers

More From: International Journal of Machine Learning and Cybernetics