Unsupervised Video-Based Action Recognition With Imagining Motion and Perceiving Appearance

Wei Lin,Xiaotong Tu,Yue Huang,Xinghao Ding,Xiaoyu Liu,Yihong Zhuang,Huanqiang Zeng

doi:10.1109/tcsvt.2022.3221280

Abstract

Video-based action recognition is a challenging task, which demands carefully considering the temporal property of videos in addition to the appearance attributes. Particularly, the temporal domain of raw videos usually contains significantly more redundant or irrelevant information than still images. For that, this paper proposes an unsupervised video-based action recognition approach with <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">imagining motion</i> and <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">perceiving appearance</i> , called IMPA, by comprehensively learning the spatio-temporal characteristics inherited in videos, with a particular emphasis on the moving object for action recognition. Specifically, a self-supervised <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Motion Extracting Block</i> (MEB) is designed to extract the principal motion features by focusing on the large movement of the moving object, based on the observation that humans can infer complete motion trajectories from partial moving objects. To further take the indispensable appearance attribute in videos into account, an unsupervised <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Appearance Learning Block</i> (ALB) is developed to perceive the static appearance, thus in combination with the MEB to recognize actions. Extensive validation experiments and ablation studies on multiple datasets demonstrate that our proposed IMPA approach obtains superior performance and surpasses other classical and state-of-the-art unsupervised action recognition methods.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unsupervised Video-Based Action Recognition With Imagining Motion and Perceiving Appearance

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: May 1, 2023
Citations: 3

Similar Papers

Cross-View Action Modeling, Learning, and Recognition
Jiang Wang ... Xiaohan Nie
-
Jiang Wang, et. al.Jiang Wang ... Xiaohan Nie
01 Jun 2014
01 Jun 2014

Temporal-Distributed Backdoor Attack against Video Based Action Recognition
Xi Li ... Mahanth Gowda
Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence | VOL. 38
Xi Li, et. al.Xi Li ... Mahanth Gowda
24 Mar 2024
Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence | VOL. 38

Adaptive spatiotemporal graph convolutional network with intermediate aggregation of multi-stream skeleton features for action recognition
Yukai Zhao ... Yunlong Ma
Neurocomputing | VOL. 505
Yukai Zhao, et. al.Yukai Zhao ... Yunlong Ma
16 Jul 2022
Neurocomputing | VOL. 505

Deep Learning for Domain-Specific Action Recognition in Tennis
Silvia Vinyes Mora ... William J. Knottenbelt
-
Silvia Vinyes Mora, et. al.Silvia Vinyes Mora ... William J. Knottenbelt
01 Jul 2017
01 Jul 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unsupervised Video-Based Action Recognition With Imagining Motion and Perceiving Appearance

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society