An efficient motion visual learning method for video action recognition

Bin Wang,Faliang Chang,Chunsheng Liu,Wenqian Wang,Ruiyi Ma

doi:10.1016/j.eswa.2024.124596

Abstract

Currently, efficient spatio-temporal information modeling is one of the key research components to solve the action recognition problem. Previous approaches focus on enhancing the backbone features individually using hierarchical structures, and unfortunately, most of them fail to achieve a better balance between the interactional adequacy of features within the structure. In this work, we propose an effective Multi-dimensional Adaptive Fusion Network (MDAF-Net), which can be embedded into the mainstream action recognition backbone in a plug-and-play manner to fully activate the transfer and representation of action features in the deep network. Specifically, our MDAF-Net contains two main components: the Adaptive Temporal Capture Module (ATCM) and the Extended Spatial and Channel Module (ESCM). The ATCM effectively suppresses the over-expression of similar features in adjacent frames and activates the expression of motion flow information. The ESCM further improves temporal modeling efficiency by extending the spatial feature perceptual field and enhancing channel attention. Extensive experiments on several challenging action recognition benchmarks, such as Something-Something V1&V2 and Kinetics-400, demonstrate that the proposed MDAF can achieve state-of-the-art and competitive performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An efficient motion visual learning method for video action recognition

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Similar Papers

Hierarchical Temporal Pooling for Efficient Online Action Recognition
Can Zhang ... Yuexian Zou
-
Can Zhang, et. al.Can Zhang ... Yuexian Zou
08 Dec 2018
08 Dec 2018

Graph Convolutional Neural Network for Human Action Recognition: A Comprehensive Survey
Tasweer Ahmad ... Xin Zhang
IEEE Transactions on Artificial Intelligence | VOL. 2
Tasweer Ahmad, et. al.Tasweer Ahmad ... Xin Zhang
01 Apr 2021
IEEE Transactions on Artificial Intelligence | VOL. 2

An efficient and lightweight multiperson activity recognition framework for robot-assisted healthcare applications
Syed Hammad Hussain Shah ... Ibrahim A Hameed
Expert Systems with Applications | VOL. 241
Syed Hammad Hussain Shah, et. al.Syed Hammad Hussain Shah ... Ibrahim A Hameed
22 Nov 2023
Expert Systems with Applications | VOL. 241

Research on Discriminative Skeleton-Based Action Recognition in Spatiotemporal Fusion and Human-Robot Interaction
Qiubo Zhong ... Caiming Zheng
Complexity | VOL. 2020
Qiubo Zhong, et. al.Qiubo Zhong ... Caiming Zheng
25 Aug 2020
Complexity | VOL. 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An efficient motion visual learning method for video action recognition

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications