Action Recognition Method Research Articles

The importance of monitoring the activities of construction equipment for evaluating their productivity has resulted in the development of many vision-based automated monitoring methods. The state-of-the-art construction equipment activity recognition methods are based on the supervised learning approach that requires large, labeled datasets for each equipment and activity. Recently, many self-supervised deep learning methods have been proposed, which exploit the abundant unlabeled data to alleviate the data annotation cost by creating labels from the input data itself. However, the assumption of availability of abundant unlabeled data limits the applicability of self-supervised methods in the area of construction equipment activity recognition. To address these problems, in this work we propose CVRLoLD, which stands for Contrastive Video Representation Learning on Limited Dataset. CVRLoLD is a self-supervised contrastive learning approach that can successfully learn to recognize construction equipment activities on a limited dataset while only a portion of the dataset is annotated. The objectives of this work are: (1) proposing a novel self-supervised method for excavator activity recognition, and (2) improving the applicability of the self-supervised learning method on the relatively small datasets available for construction equipment activity recognition. Initially, the proposed method trains a backbone network using contrastive learning on the unlabeled data. Afterwards, the labeled data are used to fine-tune the pretrained backbone. The proposed method achieved an activity recognition accuracy of 81.7% while using only 30% of the labels in the dataset. The results demonstrate the potential of the proposed method for reducing the time and efforts required for data labeling while achieving high performance on the relatively limited datasets available in the construction domain.

With the development of multimedia technologies, surveillance videos and other multimedia data have received widespread attention in several fields. Surveillance videos can monitor students' learning statuses in real time. However, the current action recognition methods for teaching have limitations. First, the ethical privacy of AI and education makes public datasets on student behavior scarce. Therefore, based on the summarization of seven typical student behaviors in the classroom, course videos were obtained from the smart classroom to generate a dataset of student behavior. Compared with existing student behavior recognition datasets, the proposed dataset is distinguished by cluttered backgrounds, crowded scenes, and occlusions. Second, relational reasoning using existing methods is not ideal for distinguishing between students' body parts and small objects in a cluttered background; the interactive utilization rate of different relational features is low, and it cannot take advantage of the complementarity of different relational features, resulting in poor performance of interaction action recognition. Therefore, the attention-based relational reasoning module strengthens the interactive representation between small objects and human body parts. At the same time, considering that there is a certain complementary relationship between relational features, this study constructs a relational feature fusion module which models a human-to-human interaction relationship built upon supporting human's body part and surrounding context. Finally, the reconstructed features and human-appearance features were fused to achieve accurate interactive action recognition. Through an experimental comparison between the proposed and current mainstream algorithms on the generated student behavior dataset, it was verified that the proposed model achieves state-of-the-art performance in action recognition.

Action Recognition Method Research Articles

Related Topics

Articles published on Action Recognition Method

Self-supervised contrastive video representation learning for construction equipment activity recognition on limited dataset

Towards Recognition of Human Actions in Collaborative Tasks with Robots: Extending Action Recognition with Tool Recognition Methods.

Student behavior recognition for interaction detection in the classroom environment

Real-time machine learning-based recognition of human thermal comfort-related activities using inertial measurement unit data

A Novel Skeleton-Based Human Activity Discovery Using Particle Swarm Optimization With Gaussian Mutation

Human Recognization Activity and Maximum Motion Representation in Surveillance Video

Human Activity Recognition Method Based on FMCW Radar Sensor with Multi-Domain Feature Attention Fusion Network.

Optimized deep learning-based cricket activity focused network and medium scale benchmark

Action Capsules: Human skeleton action recognition

Spatial–Temporal Self-Attention Enhanced Graph Convolutional Networks for Fitness Yoga Action Recognition

A Data Augmentation Method for Human Activity Recognition Based on mmWave Radar Point Cloud

Unsupervised Video-Based Action Recognition With Imagining Motion and Perceiving Appearance

Improving state estimation through projection post-processing for activity recognition with application to football

Intelligent Video Analytics for Human Action Recognition: The State of Knowledge.

HiTIM: Hierarchical Task Information Mining for Few-Shot Action Recognition

A new vehicle specific power method based on internally observable variables: Application to CO2 emission assessment for a hybrid electric vehicle

A review of video action recognition based on 3D convolution

A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition.

Action Recognition Based on 3D Skeleton and LSTM for the Monitoring of Construction Workers’ Safety Harness Usage

Skeleton-Based Multifeatures and Multistream Network for Real-Time Action Recognition

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Action Recognition Method Research Articles

Related Topics

Articles published on Action Recognition Method

Self-supervised contrastive video representation learning for construction equipment activity recognition on limited dataset

Towards Recognition of Human Actions in Collaborative Tasks with Robots: Extending Action Recognition with Tool Recognition Methods.

Student behavior recognition for interaction detection in the classroom environment

Real-time machine learning-based recognition of human thermal comfort-related activities using inertial measurement unit data

A Novel Skeleton-Based Human Activity Discovery Using Particle Swarm Optimization With Gaussian Mutation

Human Recognization Activity and Maximum Motion Representation in Surveillance Video

Human Activity Recognition Method Based on FMCW Radar Sensor with Multi-Domain Feature Attention Fusion Network.

Optimized deep learning-based cricket activity focused network and medium scale benchmark

Action Capsules: Human skeleton action recognition

Spatial–Temporal Self-Attention Enhanced Graph Convolutional Networks for Fitness Yoga Action Recognition

A Data Augmentation Method for Human Activity Recognition Based on mmWave Radar Point Cloud

Unsupervised Video-Based Action Recognition With Imagining Motion and Perceiving Appearance

Improving state estimation through projection post-processing for activity recognition with application to football

Intelligent Video Analytics for Human Action Recognition: The State of Knowledge.

HiTIM: Hierarchical Task Information Mining for Few-Shot Action Recognition

A new vehicle specific power method based on internally observable variables: Application to CO2 emission assessment for a hybrid electric vehicle

A review of video action recognition based on 3D convolution

A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition.

Action Recognition Based on 3D Skeleton and LSTM for the Monitoring of Construction Workers’ Safety Harness Usage

Skeleton-Based Multifeatures and Multistream Network for Real-Time Action Recognition