Spatiotemporal Decouple-and-Squeeze Contrastive Learning for Semisupervised Skeleton-Based Action Recognition.

Binqian Xu,Xiangbo Shu,Jiachao Zhang,Guangzhao Dai,Yan Song

doi:10.1109/tnnls.2023.3247103

Abstract

Contrastive learning has been successfully leveraged to learn action representations for addressing the problem of semisupervised skeleton-based action recognition. However, most contrastive learning-based methods only contrast global features mixing spatiotemporal information, which confuses the spatial- and temporal-specific information reflecting different semantic at the frame level and joint level. Thus, we propose a novel spatiotemporal decouple-and-squeeze contrastive learning (SDS-CL) framework to comprehensively learn more abundant representations of skeleton-based actions by jointly contrasting spatial-squeezing features, temporal-squeezing features, and global features. In SDS-CL, we design a new spatiotemporal-decoupling intra-inter attention (SIIA) mechanism to obtain the spatiotemporal-decoupling attentive features for capturing spatiotemporal specific information by calculating spatial- and temporal-decoupling intra-attention maps among joint/motion features, as well as spatial- and temporal-decoupling inter-attention maps between joint and motion features. Moreover, we present a new spatial-squeezing temporal-contrasting loss (STL), a new temporal-squeezing spatial-contrasting loss (TSL), and the global-contrasting loss (GL) to contrast the spatial-squeezing joint and motion features at the frame level, temporal-squeezing joint and motion features at the joint level, as well as global joint and motion features at the skeleton level. Extensive experimental results on four public datasets show that the proposed SDS-CL achieves performance gains compared with other competitive methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Spatiotemporal Decouple-and-Squeeze Contrastive Learning for Semisupervised Skeleton-Based Action Recognition.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems

Lead the way for us

Journal: IEEE transactions on neural networks and learning systems	Publication Date: Aug 1, 2024
Citations: 39

Similar Papers

Privacy-Preserving Fall Detection in Healthcare Using Shape and Motion Features from Low-Resolution RGB-D Videos
Irene Yu-Hua Gu ... Yixiao Yun
-
Irene Yu-Hua Gu, et. al.Irene Yu-Hua Gu ... Yixiao Yun
01 Jan 2015
01 Jan 2015

Global Motion Representation of Video Shot Based on Vector Quantization Index Histogram
Fa-Xin Yu ... Zhen Li
IEICE Transactions on Information and Systems | VOL. E92-D
Fa-Xin Yu, et. al.Fa-Xin Yu ... Zhen Li
01 Jan 2009
IEICE Transactions on Information and Systems | VOL. E92-D

MFA-Net: Motion Feature Augmented Network for Dynamic Hand Gesture Recognition from Skeletal Data.
Xinghao Chen ... Li Zhang
Sensors | VOL. 19
Xinghao Chen, et. al.Xinghao Chen ... Li Zhang
10 Jan 2019
Sensors | VOL. 19

Motion feature augmented recurrent neural network for skeleton-based dynamic hand gesture recognition
Xinghao Chen ... Li Zhang
-
Xinghao Chen, et. al.Xinghao Chen ... Li Zhang
01 Sep 2017
01 Sep 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Spatiotemporal Decouple-and-Squeeze Contrastive Learning for Semisupervised Skeleton-Based Action Recognition.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems