A Video Action Recognition Method via Dual-Stream Feature Fusion Neural Network with Attention

Jianmin Han,Jie Li

doi:10.1142/s0218488524400130

Abstract

Video action recognition is a technique for automatically determining the category of a video action. It is necessary to design an efficient video action recognition algorithm to predict video labels. This work proposes a video action recognition model based on dual-stream information fusion with attention mechanisms (DSIFAM), which consists of three different sub-modules. First, this proposes an improved keyframe extraction method (IKFE). Based on K-means clustering, this uses convolutional features to calculate the similarity between video frames instead of pixel points. After obtaining preliminary clustering results, the method performs secondary optimization to obtain more representative keyframes. Second, this proposes a video action recognition model based on dual-stream information fusion (DSIF). The method introduces ConvLSTM in the spatial stream and uses P3D instead of the original convolutional network in the temporal stream, which can better extract spatial-temporal information and improve the classification performance. Third, this designs a multi-scale attention mechanism (MSAM) to enhance the feature extraction stage and obtain higher quality classification features. The resulting features are more prominent and have stronger representation capabilities. Finally, this work conducts systematic experiments on different datasets, the results verify the superiority of DSIFAM for video action recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Video Action Recognition Method via Dual-Stream Feature Fusion Neural Network with Attention

Abstract

Talk to us

Similar Papers

More From: International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems

Lead the way for us

Similar Papers

Learning Video Actions in Two Stream Recurrent Neural Network
Ehtesham Hassan
Pattern Recognition Letters | VOL. 151
Ehtesham HassanEhtesham Hassan
01 Nov 2021
Pattern Recognition Letters | VOL. 151

Classification of Action Based Video using Heterogeneous Feature Extraction and SVM
Chandrawal Kaur ... Amit Doegar
International Journal of Innovative Technology and Exploring Engineering | VOL. 8
Chandrawal Kaur, et. al.Chandrawal Kaur ... Amit Doegar
30 Sep 2019
International Journal of Innovative Technology and Exploring Engineering | VOL. 8

Generalized zero-shot learning for action recognition with web-scale video data
Kun Liu ... Xiongxiong Dong
World Wide Web | VOL. 22
Kun Liu, et. al.Kun Liu ... Xiongxiong Dong
09 Nov 2018
World Wide Web | VOL. 22

Video Action Recognition With an Additional End-to-End Trained Temporal Stream
Guojing Cong ... Giacomo Domeniconi
-
Guojing Cong, et. al.Guojing Cong ... Giacomo Domeniconi
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Video Action Recognition Method via Dual-Stream Feature Fusion Neural Network with Attention

Abstract

Talk to us

Similar Papers

More From: International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems