Human Action Segmentation Research Articles

Human action segmentation and recognition from the continuous untrimmed sensor data stream is a challenging issue known as temporal action detection. This article provides a two-stream You Only Look Once-based network method, which fuses video and skeleton streams captured by a Kinect sensor, and our data encoding method is used to turn the spatiotemporal temporal action detection into a one-dimensional object detection problem in constantly augmented feature space. The proposed approach extracts spatial–temporal three-dimensional convolutional neural network features from video stream and view-invariant features from skeleton stream, respectively. Furthermore, these two streams are encoded into three-dimensional feature spaces, which are represented as red, green, and blue images for subsequent network input. We proposed the two-stream You Only Look Once-based networks which are capable of fusing video and skeleton information by using the processing pipeline to provide two fusion strategies, boxes-fusion or layers-fusion. We test the temporal action detection performance of two-stream You Only Look Once network based on our data set High-Speed Interplanetary Tug/Cocoon Vehicles-v1, which contains seven activities in the home environment and achieve a particularly high mean average precision. We also test our model on the public data set PKU-MMD that contains 51 activities, and our method also has a good performance on this data set. To prove that our method can work efficiently on robots, we transplanted it to the robotic platform and an online fall down detection experiment.

Read full abstract

Visual analysis of human behavior has attracted a great deal of attention in the field of computer vision because of the wide variety of potential applications. Human behavior can be segmented into atomic actions, each of which indicates a single, basic movement. To reduce human intervention in the analysis of human behavior, unsupervised learning may be more suitable than supervised learning. However, the complex nature of human behavior analysis makes unsupervised learning a challenging task. In this paper, we propose a framework for the unsupervised analysis of human behavior based on manifold learning. First, a pairwise human posture distance matrix is derived from a training action sequence. Then, the isometric feature mapping (Isomap) algorithm is applied to construct a low-dimensional structure from the distance matrix. Consequently, the training action sequence is mapped into a manifold trajectory in the Isomap space. To identify the break points between the trajectories of any two successive atomic actions, we represent the manifold trajectory in the Isomap space as a time series of low-dimensional points. A temporal segmentation technique is then applied to segment the time series into sub series, each of which corresponds to an atomic action. Next, the dynamic time warping (DTW) approach is used to cluster atomic action sequences. Finally, we use the clustering results to learn and classify atomic actions according to the nearest neighbor rule. If the distance between the input sequence and the nearest mean sequence is greater than a given threshold, it is regarded as an unknown atomic action. Experiments conducted on real data demonstrate the effectiveness of the proposed method.

Read full abstract

Human Action Segmentation Research Articles

Articles published on Human Action Segmentation

SigFormer: Sparse Signal-Guided Transformer for Multi-Modal Action Segmentation

A motion-aware and temporal-enhanced Spatial–Temporal Graph Convolutional Network for skeleton-based human action segmentation

Continuous Human Action Recognition for Human-machine Interaction: A Review

Weakly supervised coarse-to-fine learning for human action segmentation in HCI videos

Spatial Focus Attention for Fine-Grained Skeleton-Based Action Tasks

Touching events predict human action segmentation in brain and behavior

Temporal action detection based on two-stream You Only Look Once network for elderly care service robot

Fine-grained action segmentation using the semi-supervised action GAN

A discriminative structural model for joint segmentation and recognition of human actions

Human Action Segmentation Based on a Streaming Uniform Entropy Slice Method

Continuous Motion Classification and Segmentation Based on Improved Dynamic Time Warping Algorithm

Manifold Warp Segmentation of Human Action.

Inferring action structure and causal relationships in continuous sequences of human action

Trajectory-based human action segmentation

Structured Time Series Analysis for Human Action Segmentation and Recognition.

Simultaneous segmentation and classification of human actions in video streams using deeply optimized Hough transform

Segmentation and Classification of Human Actions and Actor Characteristics with 3d Motion Data

Human action segmentation and classification based on the Isomap algorithm

Human action segmentation and recognition via motion and shape analysis

ATOMIC HUMAN ACTION SEGMENTATION AND RECOGNITION USING A SPATIO-TEMPORAL PROBABILISTIC FRAMEWORK

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Human Action Segmentation Research Articles

Articles published on Human Action Segmentation

SigFormer: Sparse Signal-Guided Transformer for Multi-Modal Action Segmentation

A motion-aware and temporal-enhanced Spatial–Temporal Graph Convolutional Network for skeleton-based human action segmentation

Continuous Human Action Recognition for Human-machine Interaction: A Review

Weakly supervised coarse-to-fine learning for human action segmentation in HCI videos

Spatial Focus Attention for Fine-Grained Skeleton-Based Action Tasks

Touching events predict human action segmentation in brain and behavior

Temporal action detection based on two-stream You Only Look Once network for elderly care service robot

Fine-grained action segmentation using the semi-supervised action GAN

A discriminative structural model for joint segmentation and recognition of human actions

Human Action Segmentation Based on a Streaming Uniform Entropy Slice Method

Continuous Motion Classification and Segmentation Based on Improved Dynamic Time Warping Algorithm

Manifold Warp Segmentation of Human Action.

Inferring action structure and causal relationships in continuous sequences of human action

Trajectory-based human action segmentation

Structured Time Series Analysis for Human Action Segmentation and Recognition.

Simultaneous segmentation and classification of human actions in video streams using deeply optimized Hough transform

Segmentation and Classification of Human Actions and Actor Characteristics with 3d Motion Data

Human action segmentation and classification based on the Isomap algorithm

Human action segmentation and recognition via motion and shape analysis

ATOMIC HUMAN ACTION SEGMENTATION AND RECOGNITION USING A SPATIO-TEMPORAL PROBABILISTIC FRAMEWORK