Human Action Recognition Method Research Articles

The recognition prerformance of existing vision-based human action recognition (HAR) methods is greatly reduced in the case of low camera resolution or occlusion. Wearable sensors can provide complementary information to alleviate this problem. It is challenging to construct a robust HAR model using multimodal wearable-sensor data. In this paper, we propose a cross-Attention-based Multimodal fusion Wavelet Knowledge Distillation Network (MAWKDN) method to guide recognition from video data by acquiring complementary information from wearable sensors and reduce the noise effects through wavelet knowledge distillation, which improves the robustness of the model. A multi-attention dilated convolution kernel residual network including dilated convolution and an attention mechanism is constructed to extract features from various sensor modalities and fuse the various modal data through the cross-view attention method to acquire additional information from different modalities. To reduce the modal differences between different modalities of the teacher and student networks and acquire similar semantic knowledge, we learn the information between different modalities by constructing a graph structure of convolutional layer features, and computing the semantic preservation loss between the teacher and student networks. To reduce the influence of noise in the input data, we construct the loss of wavelet knowledge distillation, which transforms the image through the discrete wavelet transform and only retains the low frequency features to extract the useful information. The top-1 accuracy achieved on the UTD-MHAD (99.31%), Berkeley-MHAD (99.40%) and the F1-score on the MMAct (85.26% based on cross-session) dataset prove the superior performance of MAWKDN compared with the state-of-the-art HAR methods. Moreover, we demonstrate the robustness of the MAWKDN approach on the noise-added UTD-MHAD dataset.

Read full abstract

The recognition of human activities using vision-based techniques has become a crucial research field in video analytics. Over the last decade, there have been numerous advancements in deep learning algorithms aimed at accurately detecting complex human actions in video streams. While these algorithms have demonstrated impressive performance in activity recognition, they often exhibit a bias towards either model performance or computational efficiency. This biased trade-off between robustness and efficiency poses challenges when addressing complex human activity recognition problems. To address this issue, this paper presents a computationally efficient yet robust approach, exploiting saliency-aware spatial and temporal features for human action recognition in videos. To achieve effective representation of human actions, we propose an efficient approach called the dual-attentional Residual 3D Convolutional Neural Network (DA-R3DCNN). Our proposed method utilizes a unified channel-spatial attention mechanism, allowing it to efficiently extract significant human-centric features from video frames. By combining dual channel-spatial attention layers with residual 3D convolution layers, the network becomes more discerning in capturing spatial receptive fields containing objects within the feature maps. To assess the effectiveness and robustness of our proposed method, we have conducted extensive experiments on four well-established benchmark datasets for human action recognition. The quantitative results obtained validate the efficiency of our method, showcasing significant improvements in accuracy of up to 11% as compared to state-of-the-art human action recognition methods. Additionally, our evaluation of inference time reveals that the proposed method achieves up to a 74× improvement in frames per second (FPS) compared to existing approaches, thus showing the suitability and effectiveness of the proposed DA-R3DCNN for real-time human activity recognition.

Read full abstract

Human Action Recognition Method Research Articles

Related Topics

Articles published on Human Action Recognition Method

Deep Wavelet Convolutional Neural Networks for Multimodal Human Activity Recognition Using Wearable Inertial Sensors.

A Novel Lightweight Human Activity Recognition Method Via L-CTCN.

Review of Literature on Human Activity Detection and Recognition

Towards efficient video-based action recognition: context-aware memory attention network

MAWKDN: A Multimodal Fusion Wavelet Knowledge Distillation Approach Based on Cross-View Attention for Action Recognition

Occlusion-Aware Graph Neural Networks for Skeleton Action Recognition

MMTSA

Human Activity Recognition Based on Continuous-Wave Radar and Bidirectional Gate Recurrent Unit

A multi-graph convolutional network based wearable human activity recognition method using multi-sensors

Human activity recognition method using joint deep learning and acceleration signal

SKELTER: unsupervised skeleton action denoising and recognition using transformers

Wearable sensor-based human activity recognition with ensemble learning: a comparison study

A Human Activity Recognition Method Based on Lightweight Feature Extraction Combined With Pruned and Quantized CNN for Wearable Device

Human Action Representation Learning Using an Attention-Driven Residual 3DCNN Network

Enhanced Spatial Stream of Two-Stream Network Using Optical Flow for Human Action Recognition

Hang-Time HAR: A Benchmark Dataset for Basketball Activity Recognition Using Wrist-Worn Inertial Sensors.

Towards Recognition of Human Actions in Collaborative Tasks with Robots: Extending Action Recognition with Tool Recognition Methods.

A Novel Skeleton-Based Human Activity Discovery Using Particle Swarm Optimization With Gaussian Mutation

Human Activity Recognition Method Based on FMCW Radar Sensor with Multi-Domain Feature Attention Fusion Network.

A Data Augmentation Method for Human Activity Recognition Based on mmWave Radar Point Cloud

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Human Action Recognition Method Research Articles

Related Topics

Articles published on Human Action Recognition Method

Deep Wavelet Convolutional Neural Networks for Multimodal Human Activity Recognition Using Wearable Inertial Sensors.

A Novel Lightweight Human Activity Recognition Method Via L-CTCN.

Review of Literature on Human Activity Detection and Recognition

Towards efficient video-based action recognition: context-aware memory attention network

MAWKDN: A Multimodal Fusion Wavelet Knowledge Distillation Approach Based on Cross-View Attention for Action Recognition

Occlusion-Aware Graph Neural Networks for Skeleton Action Recognition

MMTSA

Human Activity Recognition Based on Continuous-Wave Radar and Bidirectional Gate Recurrent Unit

A multi-graph convolutional network based wearable human activity recognition method using multi-sensors

Human activity recognition method using joint deep learning and acceleration signal

SKELTER: unsupervised skeleton action denoising and recognition using transformers

Wearable sensor-based human activity recognition with ensemble learning: a comparison study

A Human Activity Recognition Method Based on Lightweight Feature Extraction Combined With Pruned and Quantized CNN for Wearable Device

Human Action Representation Learning Using an Attention-Driven Residual 3DCNN Network

Enhanced Spatial Stream of Two-Stream Network Using Optical Flow for Human Action Recognition

Hang-Time HAR: A Benchmark Dataset for Basketball Activity Recognition Using Wrist-Worn Inertial Sensors.

Towards Recognition of Human Actions in Collaborative Tasks with Robots: Extending Action Recognition with Tool Recognition Methods.

A Novel Skeleton-Based Human Activity Discovery Using Particle Swarm Optimization With Gaussian Mutation

Human Activity Recognition Method Based on FMCW Radar Sensor with Multi-Domain Feature Attention Fusion Network.

A Data Augmentation Method for Human Activity Recognition Based on mmWave Radar Point Cloud