Few-shot activity recognition with cross-modal memory network

Lingling Zhang,Xiaojun Chang,Jun Liu,Minnan Luo,Mahesh Prakash,Alexander G Hauptmann

doi:10.1016/j.patcog.2020.107348

Abstract

Deep learning based action recognition methods require large amount of labelled training data. However, labelling large-scale video data is time consuming and tedious. In this paper, we consider a more challenging few-shot action recognition problem where the training samples are few and rare. To solve this problem, memory network has been designed to use an external memory to remember the experience learned in training and then apply it to few-shot prediction during testing. However, existing memory-based methods just update the visual information with fixed label embeddings in the memory, which cannot adapt well to novel activities during testing. To alleviate the issue, we propose a novel end-to-end cross-modal memory network for few-shot activity recognition. Specifically, the proposed memory architecture stores the dynamic visual and textual semantics for some high-level attributes related to human activities. And the learned memory can provide effective multi-modal information for new activity recognition in the testing stage. Extensive experimental results on two video datasets, including HMDB51 and UCF101, indicate that our method could achieve significant improvements over other previous methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Pattern Recognition	Publication Date: Jul 30, 2020
Citations: 37	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Few-shot activity recognition with cross-modal memory network

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Similar Papers

Attention-Aware Sampling via Deep Reinforcement Learning for Action Recognition
Wenkai Dong ... Tieniu Tan
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33
Wenkai Dong, et. al.Wenkai Dong ... Tieniu Tan
17 Jul 2019
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33

Identifying the key frames: An attention-aware sampling method for action recognition
Wenkai Dong ... Tieniu Tan
Pattern Recognition | VOL. 130
Wenkai Dong, et. al.Wenkai Dong ... Tieniu Tan
19 May 2022
Pattern Recognition | VOL. 130

An Online Approach for Gesture Recognition Toward Real-World Applications
Zhaoxuan Fan ... Wanli Jiang
-
Zhaoxuan Fan, et. al.Zhaoxuan Fan ... Wanli Jiang
01 Jan 2017
01 Jan 2017

Research on deep learning-based action recognition and quantitative assessment method for sports skills
Tao Wang
Applied Mathematics and Nonlinear Sciences | VOL. 9
Tao WangTao Wang
01 Jan 2024
Applied Mathematics and Nonlinear Sciences | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Few-shot activity recognition with cross-modal memory network

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition