Attention-guided mask learning for self-supervised 3D action recognition

Haoyuan Zhang

doi:10.1007/s40747-024-01558-1

Haoyuan Zhang

Open Access

https://doi.org/10.1007/s40747-024-01558-1

Copy DOI

Export

Save

Cite

Journal: Complex & Intelligent Systems	Publication Date: Jul 19, 2024
License type: CC BY 4.0

Abstract
Full-Text
Similar Papers

Abstract

Listen

Most existing 3D action recognition works rely on the supervised learning paradigm, yet the limited availability of annotated data limits the full potential of encoding networks. As a result, effective self-supervised pre-training strategies have been actively researched. In this paper, we target to explore a self-supervised learning approach for 3D action recognition, and propose the Attention-guided Mask Learning (AML) scheme. Specifically, the dropping mechanism is introduced into contrastive learning to develop Attention-guided Mask (AM) module as well as mask learning strategy, respectively. The AM module leverages the spatial and temporal attention to guide the corresponding features masking, so as to produce the masked contrastive object. The mask learning strategy enables the model to discriminate different actions even with important features masked, which makes action representation learning more discriminative. What’s more, to alleviate the strict positive constraint that would hinder representation learning, the positive-enhanced learning strategy is leveraged in the second-stage training. Extensive experiments on NTU-60, NTU-120, and PKU-MMD datasets show that the proposed AML scheme improves the performance in self-supervised 3D action recognition, achieving state-of-the-art results.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Attention-guided mask learning for self-supervised 3D action recognition

Abstract

Published Version

Talk to us

Similar Papers

More From: Complex & Intelligent Systems

Lead the way for us

Similar Papers

Adversarial Self-supervised Learning for Semi-supervised 3D Action Recognition
Chenyang Si ... Liang Wang
-
Chenyang Si, et. al.Chenyang Si ... Liang Wang
01 Jan 2020
01 Jan 2020

SpATr: MoCap 3D human action recognition based on spiral auto-encoder and transformer network
Hamza Bouzid ... Lahoucine Ballihi
Computer Vision and Image Understanding | VOL. 241
Hamza Bouzid, et. al.Hamza Bouzid ... Lahoucine Ballihi
17 Feb 2024
Computer Vision and Image Understanding | VOL. 241

A Unified Deep Framework for Joint 3D Pose Estimation and Action Recognition from a Single RGB Camera.
Huy Hieu Pham ... Alain Crouzil
Sensors | VOL. 20
Huy Hieu Pham, et. al.Huy Hieu Pham ... Alain Crouzil
25 Mar 2020
Sensors | VOL. 20

SkeleMotion: A New Representation of Skeleton Joint Sequences based on Motion Information for 3D Action Recognition
Carlos Caetano ... Franeois Bremond
-
Carlos Caetano, et. al.Carlos Caetano ... Franeois Bremond
01 Sep 2019
01 Sep 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Attention-guided mask learning for self-supervised 3D action recognition

Abstract

Published Version

Talk to us

Similar Papers

More From: Complex & Intelligent Systems