Multi-views reinforced LSTM for video-based action recognition

Zhenzhen Mao,Jun Kong,Min Jiang

doi:10.1117/1.jei.30.5.053021

Abstract

Recently, the long short-term memory network (LSTM) and attention mechanism have greatly boosted the research of video-based action recognition. For this task, feature extraction especially temporal feature extraction is essential. However, most studies focus on improving the temporal feature extraction ability of the model, ignoring the lack of temporal information in the input. To alleviate the issue above, we propose multi-views reinforced LSTM (MR-LSTM). First, we propose an innovative feature extractor named multi-views temporal feature extractor (MTFE) to extract multi-views temporal features from RGB frames in different views. Secondly, we propose multi-views reinforced attention (MRA) mechanism, which utilizes multi-views features to enrich the temporal information in the input of LSTM. MTFE and MRA mechanisms alleviate the lack of temporal information in the input of LSTM. Equipped with the modules above, LSTM can extract more discriminative temporal features. Finally, we propose non-fair fusion strategy to obtain more discriminative fusion features that are beneficial for classification. The ablation experiment demonstrates the effectiveness of all proposed modules. In comprehensive experiments on UCF101 and HMDB51 datasets, our architecture performs competitively against state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-views reinforced LSTM for video-based action recognition

Abstract

Talk to us

Similar Papers

More From: Journal of Electronic Imaging

Lead the way for us

Journal: Journal of Electronic Imaging	Publication Date: Oct 9, 2021
Citations: 2

Similar Papers

Extracting Temporal Features by Key Points Transfer for Effective Action Recognition
Chenxi Liao ... Yuecong Xu
-
Chenxi Liao, et. al.Chenxi Liao ... Yuecong Xu
13 Dec 2020
13 Dec 2020

Effective action recognition with embedded key point shifts
Haozhi Cao ... Simon See
Pattern Recognition | VOL. 120
Haozhi Cao, et. al.Haozhi Cao ... Simon See
18 Jul 2021
Pattern Recognition | VOL. 120

Gait feature learning via spatio-temporal two-branch networks
Yifan Chen ... Xuelong Li
Pattern Recognition | VOL. 147
Yifan Chen, et. al.Yifan Chen ... Xuelong Li
07 Nov 2023
Pattern Recognition | VOL. 147

ALTASN: A few-shot learning fault diagnosis method for rotating machinery of unmanned surface vehicles based on attention mechanism
Yu Cao ... Mohammed Abdulaal
Transactions of the Institute of Measurement and Control | VOL. -
Yu Cao, et. al.Yu Cao ... Mohammed Abdulaal
15 Apr 2024
Transactions of the Institute of Measurement and Control | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-views reinforced LSTM for video-based action recognition

Abstract

Talk to us

Similar Papers

More From: Journal of Electronic Imaging