Frame Motion Research Articles

Self-supervised learning has demonstrated remarkable capability in representation learning for skeleton-based action recognition. Existing methods mainly focus on applying global data augmentation to generate different views of the skeleton sequence for contrastive learning. However, due to the rich action clues in the skeleton sequences, existing methods may only take a global perspective to learn to discriminate different skeletons without thoroughly leveraging the local relationship between different skeleton joints and video frames, which is essential for real-world applications. In this work, we propose a Partial Spatio-Temporal Learning (PSTL) framework to exploit the local relationship from a partial skeleton sequences built by a unique spatio-temporal masking strategy. Specifically, we construct a negative-sample-free triplet steam structure that is composed of an anchor stream without any masking, a spatial masking stream with Central Spatial Masking (CSM), and a temporal masking stream with Motion Attention Temporal Masking (MATM). The feature cross-correlation matrix is measured between the anchor stream and the other two masking streams, respectively. (1) Central Spatial Masking discards selected joints from the feature calculation process, where the joints with a higher degree of centrality have a higher possibility of being selected. (2) Motion Attention Temporal Masking leverages the motion of action and remove frames that move faster with a higher possibility. Our method achieves state-of-the-art performance on NTURGB+D 60, NTURGB+D 120 and PKU-MMD under various downstream tasks. Furthermore, to simulate the real-world scenarios, a practical evaluation is performed where some skeleton joints are lost in downstream tasks.In contrast to previous methods that suffer from large performance drops, our PSTL can still achieve remarkable results under this challenging setting, validating the robustness of our method.

Read full abstract

The coordinate frames for color and motion are often defined by three dimensions (e.g., responses from the three types of human cone photoreceptors for color and the three dimensions of space for motion). Does this common dimensionality lead to similar perceptual representations? Here we show that the organizational principles for the representation of hue and motion direction are instead profoundly different. We compared observers' judgments of hue and motion direction using functionally equivalent stimulus metrics, behavioral tasks, and computational analyses, and used the pattern of individual differences to decode the underlying representational structure for these features. Hue judgments were assessed using a standard "hue-scaling" task (i.e., judging the proportion of red/green and blue/yellow in each hue). Motion judgments were measured using a "motion-scaling" task (i.e., judging the proportion of left/right and up/down motion in moving dots). Analyses of the interobserver variability in hue scaling revealed multiple independent factors limited to different local regions of color space. This is inconsistent with the influences across a broad range of hues predicted by conventional color-opponent models. In contrast, variations in motion scaling were characterized by more global factors plausibly related to variation in the relative weightings of the cardinal spatial axes. These results suggest that although the coordinate frames for specifying color and motion share a common dimensional structure, the perceptual coding principles for hue and motion direction are distinct. These differences might reflect a distinction between the computational strategies required for the visual analysis of spatial vs. nonspatial attributes of the world.

Read full abstract

Frame Motion Research Articles

Related Topics

Articles published on Frame Motion

Non-Uniform Motion Aggregation with Graph Convolutional Networks for Skeleton-Based Human Action Recognition

Research on Three-Dimensional Shape Curve Reconstruction Technology for a Scraper Conveyor on an Intelligent Working Face.

Ai-Based Building Security System Using Vision Tracking Motion Method

Automatic highlight detection in videos of martial arts tricking

Lagrangian reduction and wave mean flow interaction

3D Information Guided Motion Transfer via Sequential Image Based Human Model Refinement and Face-Attention GAN

Self-Supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences

A New Surveillance and Security Alert System Based on Real-Time Motion Detection

Multimodal dance style transfer

Research on effects of rolling motion on a pressurizer surge line based on the fluid-solid-thermal coupling method

Dynamic machine vision with retinomorphic photomemristor-reservoir computing

Considerations on the Relativity of Quantum Irrealism

ROTATION MINIMIZING SPHERICAL MOTIONS AND HELICES

Continuous frame motion sensitive self-supervised collaborative network for video representation learning

Hierarchical Motion Excitation Network for Few-Shot Video Recognition

Lightweight Deep Neural Network Embedded with Stochastic Variational Inference Loss Function for Fast Detection of Human Postures.

HGRBOL2: Human gait recognition for biometric application using Bayesian optimization and extreme learning machine

Fundamentally different representations of color and motion revealed by individual differences in perceptual scaling

Biomac3D: 2D-to-3D Human Pose Analysis Model for Tele-Rehabilitation Based on Pareto Optimized Deep-Learning Architecture

A Deep Learning Method for Motion Artifact Correction in Intravascular Photoacoustic Image Sequence.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Frame Motion Research Articles

Related Topics

Articles published on Frame Motion

Non-Uniform Motion Aggregation with Graph Convolutional Networks for Skeleton-Based Human Action Recognition

Research on Three-Dimensional Shape Curve Reconstruction Technology for a Scraper Conveyor on an Intelligent Working Face.

Ai-Based Building Security System Using Vision Tracking Motion Method

Automatic highlight detection in videos of martial arts tricking

Lagrangian reduction and wave mean flow interaction

3D Information Guided Motion Transfer via Sequential Image Based Human Model Refinement and Face-Attention GAN

Self-Supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences

A New Surveillance and Security Alert System Based on Real-Time Motion Detection

Multimodal dance style transfer

Research on effects of rolling motion on a pressurizer surge line based on the fluid-solid-thermal coupling method

Dynamic machine vision with retinomorphic photomemristor-reservoir computing

Considerations on the Relativity of Quantum Irrealism

ROTATION MINIMIZING SPHERICAL MOTIONS AND HELICES

Continuous frame motion sensitive self-supervised collaborative network for video representation learning

Hierarchical Motion Excitation Network for Few-Shot Video Recognition

Lightweight Deep Neural Network Embedded with Stochastic Variational Inference Loss Function for Fast Detection of Human Postures.

HGRBOL2: Human gait recognition for biometric application using Bayesian optimization and extreme learning machine

Fundamentally different representations of color and motion revealed by individual differences in perceptual scaling

Biomac3D: 2D-to-3D Human Pose Analysis Model for Tele-Rehabilitation Based on Pareto Optimized Deep-Learning Architecture

A Deep Learning Method for Motion Artifact Correction in Intravascular Photoacoustic Image Sequence.