Network For Action Recognition Research Articles

The advancements in intelligent action recognition can be instrumental in developing autonomous robotic systems capable of analyzing complex human activities in real-time, contributing to the growing field of robotics that operates in dynamic environments. The precise recognition of basketball players' actions using artificial intelligence technology can provide valuable assistance and guidance to athletes, coaches, and analysts, and can help referees make fairer decisions during games. However, unlike action recognition in simpler scenarios, the background in basketball is similar and complex, the differences between various actions are subtle, and lighting conditions are inconsistent, making action recognition in basketball a challenging task. To address this problem, an Adaptive Context-Aware Network (ACA-Net) for basketball player action recognition is proposed in this paper. It contains a Long Short-term Adaptive (LSTA) module and a Triplet Spatial-Channel Interaction (TSCI) module to extract effective features at the temporal, spatial, and channel levels. The LSTA module adaptively learns global and local temporal features of the video. The TSCI module enhances the feature representation by learning the interaction features between space and channels. We conducted extensive experiments on the popular basketball action recognition datasets SpaceJam and Basketball-51. The results show that ACA-Net outperforms the current mainstream methods, achieving 89.26% and 92.05% in terms of classification accuracy on the two datasets, respectively. ACA-Net's adaptable architecture also holds potential for real-world applications in autonomous robotics, where accurate recognition of complex human actions in unstructured environments is crucial for tasks such as automated game analysis, player performance evaluation, and enhanced interactive broadcasting experiences.

Most previous few-shot action recognition works tend to process video temporal and spatial features separately, resulting in insufficient extraction of comprehensive features. In this paper, a novel hybrid attentive prototypical network (HAPN) framework for few-shot action recognition is proposed. Distinguished by its joint processing of temporal and spatial information, the HAPN framework strategically manipulates these dimensions from feature extraction to the attention module, consequently enhancing its ability to perform action recognition tasks. Our framework utilizes the R(2+1)D backbone network, coupling the extraction of integrated temporal and spatial features to ensure a comprehensive understanding of video content. Additionally, our framework introduces the novel Residual Tri-dimensional Attention (ResTriDA) mechanism, specifically designed to augment feature information across the temporal, spatial, and channel dimensions. ResTriDA dynamically enhances crucial aspects of video features by amplifying significant channel-wise features for action distinction, accentuating spatial details vital for capturing the essence of actions within frames, and emphasizing temporal dynamics to capture movement over time. We further propose a prototypical attentive matching module (PAM) built on the concept of metric learning to resolve the overfitting issue common in few-shot tasks. We evaluate our HAPN framework on three classical few-shot action recognition datasets: Kinetics-100, UCF101, and HMDB51. The results indicate that our framework significantly outperformed state-of-the-art methods. Notably, the 1-shot task, demonstrated an increase of 9.8% in accuracy on UCF101 and improvements of 3.9% on HMDB51 and 12.4% on Kinetics-100. These gains confirm the robustness and effectiveness of our approach in leveraging limited data for precise action recognition.

Network For Action Recognition Research Articles

Related Topics

Articles published on Network For Action Recognition

Dynamic spatial-temporal topology graph network for skeleton-based action recognition

A GCN and Transformer complementary network for skeleton-based action recognition

Decoupled Knowledge Embedded Graph Convolutional Network for Skeleton-Based Human Action Recognition

Spiking Neural Networks for event-based action recognition: A new task to understand their advantage

ACA-Net: adaptive context-aware network for basketball action recognition.

Attention-Guided and Topology-Enhanced Shift Graph Convolutional Network for Skeleton-Based Action Recognition

Prompt-supervised dynamic attention graph convolutional network for skeleton-based action recognition

A novel multi-stream hand-object interaction network for assembly action recognition

Improved semantic-guided network for skeleton-based action recognition

Hybrid attentive prototypical network for few-shot action recognition

Cross-modal guides spatio-temporal enrichment network for few-shot action recognition

Variation-aware directed graph convolutional networks for skeleton-based action recognition

Multi-Scale Structural Graph Convolutional Network for Skeleton-Based Action Recognition

Human movement science-informed multi-task spatio temporal graph convolutional networks for fitness action recognition and evaluation

Automated Laryngeal Invasion Detector of Boluses in Videofluoroscopic Swallowing Study Videos Using Action Recognition-Based Networks.

Glimpse and Zoom: Spatio-Temporal Focused Dynamic Network for Skeleton-Based Action Recognition

PointDMIG: a dynamic motion-informed graph neural network for 3D action recognition

Multi-scale sampling attention graph convolutional networks for skeleton-based action recognition

Differential motion attention network for efficient action recognition

Spatial-temporal multiscale feature optimization based two-stream convolutional neural network for action recognition

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Network For Action Recognition Research Articles

Related Topics

Articles published on Network For Action Recognition

Dynamic spatial-temporal topology graph network for skeleton-based action recognition

A GCN and Transformer complementary network for skeleton-based action recognition

Decoupled Knowledge Embedded Graph Convolutional Network for Skeleton-Based Human Action Recognition

Spiking Neural Networks for event-based action recognition: A new task to understand their advantage

ACA-Net: adaptive context-aware network for basketball action recognition.

Attention-Guided and Topology-Enhanced Shift Graph Convolutional Network for Skeleton-Based Action Recognition

Prompt-supervised dynamic attention graph convolutional network for skeleton-based action recognition

A novel multi-stream hand-object interaction network for assembly action recognition

Improved semantic-guided network for skeleton-based action recognition

Hybrid attentive prototypical network for few-shot action recognition

Cross-modal guides spatio-temporal enrichment network for few-shot action recognition

Variation-aware directed graph convolutional networks for skeleton-based action recognition

Multi-Scale Structural Graph Convolutional Network for Skeleton-Based Action Recognition

Human movement science-informed multi-task spatio temporal graph convolutional networks for fitness action recognition and evaluation

Automated Laryngeal Invasion Detector of Boluses in Videofluoroscopic Swallowing Study Videos Using Action Recognition-Based Networks.

Glimpse and Zoom: Spatio-Temporal Focused Dynamic Network for Skeleton-Based Action Recognition

PointDMIG: a dynamic motion-informed graph neural network for 3D action recognition

Multi-scale sampling attention graph convolutional networks for skeleton-based action recognition

Differential motion attention network for efficient action recognition

Spatial-temporal multiscale feature optimization based two-stream convolutional neural network for action recognition