A Novel Spatio-Temporal-Wise Network for Action Recognition

Zhengbao Cai

doi:10.1109/access.2023.3274542

Abstract

Action recognition is a challenging task that requires understanding the temporal relationships between frames. However, capturing and processing spatio-temporal and motion features is computationally expensive, making it difficult to apply to practical situations. We propose a novel approach called the Spatio-Temporal-Wise (STW) network to address this problem. The STW network inserts STW blocks, consisting of a Spatio-Temporal Fusion Module and a Temporal-Wise Module, into an existing 2D CNN. This approach requires very little additional computational overhead but brings huge performance improvements in recognizing human actions. The proposed method is evaluated on several public datasets, including Something-Something v1 & v2, Kinetics-400, UCF101, and HMDB51. STW achieved comparable or better performance on these datasets compared to state-of-the-art methods. Notably, the STW network improves recognition accuracy by 26.6% and 34.6% on the Something-Something v1 & v2 datasets, respectively, with less than 2% additional computational overhead. The results demonstrate that the STW network can significantly improve performance in action recognition tasks while requiring only a small additional computational overhead, which represents a promising direction for developing more efficient and effective approaches to handling temporal reasoning in action recognition, which may have important applications in the future.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Novel Spatio-Temporal-Wise Network for Action Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Journal: IEEE Access	Publication Date: Jan 1, 2023
License type: CC BY-NC-ND 4.0

Similar Papers

METIER
Ling Chen ... Yi Zhang
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies | VOL. 4
Ling Chen, et. al.Ling Chen ... Yi Zhang
18 Mar 2020
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies | VOL. 4

Spatio-temporal adaptive convolution and bidirectional motion difference fusion for video action recognition
Linxi Li ... Mingfeng Zhao
Expert Systems With Applications | VOL. 255
Linxi Li, et. al.Linxi Li ... Mingfeng Zhao
27 Jul 2024
Expert Systems With Applications | VOL. 255

Continuous Multi-View Human Action Recognition
Qiang Wang ... Jiahua Dong
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32
Qiang Wang, et. al.Qiang Wang ... Jiahua Dong
01 Jun 2022
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32

Frame-skip Convolutional Neural Networks for action recognition
Yinan Liu ... Liangzhi Tang
-
Yinan Liu, et. al. Yinan Liu ... Liangzhi Tang
01 Jul 2017
01 Jul 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Spatio-Temporal-Wise Network for Action Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Access