OTM-HC: Enhanced Skeleton-Based Action Representation via One-to-Many Hierarchical Contrastive Learning

Muhammad Usman,Wenming Cao,Zhao Huang,Jianqi Zhong,Ruiya Ji

doi:10.3390/ai5040106

Abstract

Human action recognition has become crucial in computer vision, with growing applications in surveillance, human–computer interaction, and healthcare. Traditional approaches often use broad feature representations, which may miss subtle variations in timing and movement within action sequences. Our proposed One-to-Many Hierarchical Contrastive Learning (OTM-HC) framework maps the input into multi-layered feature vectors, creating a hierarchical contrast representation that captures various granularities within a human skeleton sequence temporal and spatial domains. Using sequence-to-sequence (Seq2Seq) transformer encoders and downsampling modules, OTM-HC can distinguish between multiple levels of action representations, such as instance, domain, clip, and part levels. Each level contributes significantly to a comprehensive understanding of action representations. The OTM-HC model design is adaptable, ensuring smooth integration with advanced Seq2Seq encoders. We tested the OTM-HC framework across four datasets, demonstrating improved performance over state-of-the-art models. Specifically, OTM-HC achieved improvements of 0.9% and 0.6% on NTU60, 0.4% and 0.7% on NTU120, and 0.7% and 0.3% on PKU-MMD I and II, respectively, surpassing previous leading approaches across these datasets. These results showcase the robustness and adaptability of our model for various skeleton-based action recognition tasks.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

OTM-HC: Enhanced Skeleton-Based Action Representation via One-to-Many Hierarchical Contrastive Learning

Abstract

Published Version

Talk to us

Similar Papers

More From: AI

Lead the way for us

Journal: AI	Publication Date: Nov 1, 2024
License type: CC BY 4.0

Similar Papers

Visual cues for view-invariant human action recognition

-

17 Feb 2017
17 Feb 2017

View-Invariant Action Recognition
Yogesh Singh Rawat ... Shruti Vyas
-
Yogesh Singh Rawat, et. al.Yogesh Singh Rawat ... Shruti Vyas
01 Jan 2020
01 Jan 2020

Internet-of-Things-Based Suspicious Activity Recognition Using Multimodalities of Computer Vision for Smart City Security
Amjad Rehman ... Robertas Damaševičius
Security and Communication Networks | VOL. 2022
Amjad Rehman, et. al.Amjad Rehman ... Robertas Damaševičius
05 Oct 2022
Security and Communication Networks | VOL. 2022

Graph Convolutional Neural Network for Human Action Recognition: A Comprehensive Survey
Tasweer Ahmad ... Lianwen Jin
IEEE Transactions on Artificial Intelligence | VOL. 2
Tasweer Ahmad, et. al.Tasweer Ahmad ... Lianwen Jin
01 Apr 2021
IEEE Transactions on Artificial Intelligence | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

OTM-HC: Enhanced Skeleton-Based Action Representation via One-to-Many Hierarchical Contrastive Learning

Abstract

Published Version

Talk to us

Similar Papers

More From: AI