Action Recognition Method Research Articles

The recognition of human activities using vision-based techniques has become a crucial research field in video analytics. Over the last decade, there have been numerous advancements in deep learning algorithms aimed at accurately detecting complex human actions in video streams. While these algorithms have demonstrated impressive performance in activity recognition, they often exhibit a bias towards either model performance or computational efficiency. This biased trade-off between robustness and efficiency poses challenges when addressing complex human activity recognition problems. To address this issue, this paper presents a computationally efficient yet robust approach, exploiting saliency-aware spatial and temporal features for human action recognition in videos. To achieve effective representation of human actions, we propose an efficient approach called the dual-attentional Residual 3D Convolutional Neural Network (DA-R3DCNN). Our proposed method utilizes a unified channel-spatial attention mechanism, allowing it to efficiently extract significant human-centric features from video frames. By combining dual channel-spatial attention layers with residual 3D convolution layers, the network becomes more discerning in capturing spatial receptive fields containing objects within the feature maps. To assess the effectiveness and robustness of our proposed method, we have conducted extensive experiments on four well-established benchmark datasets for human action recognition. The quantitative results obtained validate the efficiency of our method, showcasing significant improvements in accuracy of up to 11% as compared to state-of-the-art human action recognition methods. Additionally, our evaluation of inference time reveals that the proposed method achieves up to a 74× improvement in frames per second (FPS) compared to existing approaches, thus showing the suitability and effectiveness of the proposed DA-R3DCNN for real-time human activity recognition.

Vision-based human activity recognition (HAR) has emerged as one of the essential research areas in video analytics. Over the last decade, numerous advanced deep learning algorithms have been introduced to recognize complex human actions from video streams. These deep learning algorithms have shown impressive performance for the video analytics task. However, these newly introduced methods either exclusively focus on model performance or the effectiveness of these models in terms of computational efficiency, resulting in a biased trade-off between robustness and computational efficiency in their proposed methods to deal with challenging HAR problem. To enhance both the accuracy and computational efficiency, this paper presents a computationally efficient yet generic spatial-temporal cascaded framework that exploits the deep discriminative spatial and temporal features for HAR. For efficient representation of human actions, we propose an efficient dual attentional convolutional neural network (DA-CNN) architecture that leverages a unified channel-spatial attention mechanism to extract human-centric salient features in video frames. The dual channel-spatial attention layers together with the convolutional layers learn to be more selective in the spatial receptive fields having objects within the feature maps. The extracted discriminative salient features are then forwarded to a stacked bi-directional gated recurrent unit (Bi-GRU) for long-term temporal modeling and recognition of human actions using both forward and backward pass gradient learning. Extensive experiments are conducted on three publicly available human action datasets, where the obtained results verify the effectiveness of our proposed framework (DA-CNN+Bi-GRU) over the state-of-the-art methods in terms of model accuracy and inference runtime across each dataset. Experimental results show that the DA-CNN+Bi-GRU framework attains an improvement in execution time up to 167× in terms of frames per second as compared to most of the contemporary action-recognition methods.

Action Recognition Method Research Articles

Related Topics

Articles published on Action Recognition Method

Retracted: Dance-Specific Action Recognition Method Based on Double-Stream CNN in Complex Environment.

Tell me what you see: A zero-shot action recognition method based on natural language descriptions

Human activity recognition method using joint deep learning and acceleration signal

Weakly Supervised Temporal Convolutional Networks for Fine-Grained Surgical Activity Recognition.

SKELTER: unsupervised skeleton action denoising and recognition using transformers

An effective PoseC3D model for typical action recognition of dairy cows based on skeleton features

Retracted: Study about Football Action Recognition Method Based on Deep Learning and Improved Dynamic Time Warping Algorithm

3D-unified spatial-temporal graph for group activity recognition

Wearable sensor-based human activity recognition with ensemble learning: a comparison study

A Human Activity Recognition Method Based on Lightweight Feature Extraction Combined With Pruned and Quantized CNN for Wearable Device

Human Action Representation Learning Using an Attention-Driven Residual 3DCNN Network

Enhanced Spatial Stream of Two-Stream Network Using Optical Flow for Human Action Recognition

Advancing human action recognition: A hybrid approach using attention-based LSTM and 3D CNN

Two-Person Graph Convolutional Network for Skeleton-Based Human Interaction Recognition

CHAN: Skeleton based action recognition by multi‐level feature learning

Spatial Hard Attention Modeling via Deep Reinforcement Learning for Skeleton-Based Human Activity Recognition

Action Recognition via Adaptive Semi-Supervised Feature Analysis

Novel Motion Patterns Matter for Practical Skeleton-Based Action Recognition

Human Activity Recognition Using Cascaded Dual Attention CNN and Bi-Directional GRU Framework.

Hang-Time HAR: A Benchmark Dataset for Basketball Activity Recognition Using Wrist-Worn Inertial Sensors.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Action Recognition Method Research Articles

Related Topics

Articles published on Action Recognition Method

Retracted: Dance-Specific Action Recognition Method Based on Double-Stream CNN in Complex Environment.

Tell me what you see: A zero-shot action recognition method based on natural language descriptions

Human activity recognition method using joint deep learning and acceleration signal

Weakly Supervised Temporal Convolutional Networks for Fine-Grained Surgical Activity Recognition.

SKELTER: unsupervised skeleton action denoising and recognition using transformers

An effective PoseC3D model for typical action recognition of dairy cows based on skeleton features

Retracted: Study about Football Action Recognition Method Based on Deep Learning and Improved Dynamic Time Warping Algorithm

3D-unified spatial-temporal graph for group activity recognition

Wearable sensor-based human activity recognition with ensemble learning: a comparison study

A Human Activity Recognition Method Based on Lightweight Feature Extraction Combined With Pruned and Quantized CNN for Wearable Device

Human Action Representation Learning Using an Attention-Driven Residual 3DCNN Network

Enhanced Spatial Stream of Two-Stream Network Using Optical Flow for Human Action Recognition

Advancing human action recognition: A hybrid approach using attention-based LSTM and 3D CNN

Two-Person Graph Convolutional Network for Skeleton-Based Human Interaction Recognition

CHAN: Skeleton based action recognition by multi‐level feature learning

Spatial Hard Attention Modeling via Deep Reinforcement Learning for Skeleton-Based Human Activity Recognition

Action Recognition via Adaptive Semi-Supervised Feature Analysis

Novel Motion Patterns Matter for Practical Skeleton-Based Action Recognition

Human Activity Recognition Using Cascaded Dual Attention CNN and Bi-Directional GRU Framework.

Hang-Time HAR: A Benchmark Dataset for Basketball Activity Recognition Using Wrist-Worn Inertial Sensors.