Articles published on Activity Recognition
Authors
Select Authors
Journals
Select Journals
Duration
Select Duration
14443 Search results
Sort by Recency
- New
- Research Article
- 10.1016/j.neunet.2025.108508
- May 1, 2026
- Neural networks : the official journal of the International Neural Network Society
- Ying Wu + 4 more
ASR-GCN: Adaptive spatial information reconstruction GCN for skeleton-based action recognition.
- New
- Research Article
- 10.1016/j.ejmech.2026.118761
- May 1, 2026
- European journal of medicinal chemistry
- Wenwu Liu + 7 more
Cyclopropyl as a versatile tool in the development of kinase-targeted therapeutics.
- New
- Research Article
- 10.1016/j.neucom.2026.133091
- May 1, 2026
- Neurocomputing
- Shaocan Liu + 4 more
FMFNet: A Faster Multimodal Fusion Network for action recognition via efficient modality compensation
- New
- Research Article
- 10.1016/j.engappai.2026.114251
- May 1, 2026
- Engineering Applications of Artificial Intelligence
- Rahmat Ali + 3 more
Continuous Learning Enabled Dual Attention-based Fairness-aware eXplainable Convolutional Network for bias mitigation in a critical decision system for human action recognition
- New
- Research Article
- 10.1016/j.asoc.2026.114835
- May 1, 2026
- Applied Soft Computing
- Chenshuang Li + 11 more
TE-STGCN: Topology enhanced spatio-temporal graph convolutional network for skeleton-based action recognition
- New
- Research Article
- 10.1016/j.patcog.2025.112897
- May 1, 2026
- Pattern Recognition
- Ziliang Ren + 4 more
Multimodal alignment of event and text streams in spiking neural networks for human action recognition
- New
- Research Article
1
- 10.1109/tpami.2026.3651319
- May 1, 2026
- IEEE transactions on pattern analysis and machine intelligence
- Hao Dong + 6 more
Domain adaptation and generalization are crucial for real-world applications, such as autonomous driving and medical imaging where the model must operate reliably across environments with distinct data distributions. However, these tasks are challenging because the model needs to overcome various domain gaps caused by variations in, for example, lighting, weather, sensor configurations, and so on. Addressing domain gaps simultaneously in different modalities, known as multimodal domain adaptation and generalization, is even more challenging due to unique challenges in different modalities. Over the past few years, significant progress has been made in these areas, with applications ranging from action recognition to semantic segmentation, and more. Recently, the emergence of large-scale pre-trained multimodal foundation models, such as CLIP, has inspired numerous research studies, which leverage these models to enhance downstream adaptation and generalization. This survey summarizes recent advances in multimodal adaptation and generalization, particularly how these areas evolve from traditional approaches to foundation models. Specifically, this survey covers (1) multimodal domain adaptation, (2) multimodal test-time adaptation, (3) multimodal domain generalization, (4) domain adaptation and generalization with the help of multimodal foundation models, and (5) adaptation of multimodal foundation models. For each topic, we formally define the problem and give a thorough review of existing methods. Additionally, we analyze relevant datasets and applications, highlighting open challenges and potential future research directions.
- New
- Research Article
- 10.1016/j.measurement.2026.121284
- May 1, 2026
- Measurement
- Harun Sevinç + 1 more
A data-centric approach to radar-based human action recognition: SVD-based clutter removal and RTM/DTM feature fusion
- New
- Research Article
- 10.1016/j.eswa.2026.131132
- May 1, 2026
- Expert Systems with Applications
- Biao Yang + 2 more
Motion-aware multi-level feature fusion for few-shot action recognition
- New
- Research Article
- 10.1016/j.eswa.2026.131204
- May 1, 2026
- Expert Systems with Applications
- Wenming Cao + 2 more
Hierarchical joint contrastive learning with knowledge distillation for self-supervised 3D skeleton-based action recognition
- New
- Research Article
- 10.1016/j.neucom.2026.133172
- May 1, 2026
- Neurocomputing
- Qi Zhang + 6 more
An unsupervised open-set recognition method for user-independent human activity recognition
- New
- Research Article
- 10.1016/j.cosrev.2025.100879
- May 1, 2026
- Computer Science Review
- Mukesh Dalal + 1 more
A systematic review of deep learning-based models for elderly and human activity recognition
- New
- Research Article
- 10.1016/j.adhoc.2026.104192
- May 1, 2026
- Ad Hoc Networks
- Zhiwei Wen + 5 more
ESP-Fi HAR: A low-power WiFi CSI dataset for Ad-Hoc IoT human activity recognition
- New
- Research Article
- 10.1016/j.isci.2026.115252
- May 1, 2026
- iScience
- Qu Wang + 6 more
Pedestrian navigation activity recognition method based on two-stream transformer and contrastive learning.
- New
- Research Article
- 10.1016/j.bspc.2026.109667
- May 1, 2026
- Biomedical Signal Processing and Control
- Zhiyan Lin + 3 more
Adaptive temporal convolutional network with multi-head EMA-gated attention for continuous radar-based human activity recognition
- New
- Research Article
- 10.1016/j.pmcj.2026.102198
- May 1, 2026
- Pervasive and Mobile Computing
- Fahad Ayaz + 7 more
Benchmarking Radar Preprocessing Techniques and Transfer Learning Models for FMCW-based Human Activity Recognition
- New
- Research Article
- 10.22266/ijies2026.0430.70
- Apr 30, 2026
- International Journal of Intelligent Engineering and Systems
Ensuring occupational safety in shipyard welding demands reliable activity recognition under constrained postures, mechanical vibration, and electromagnetic interference.This study presents a wearable smartphone-based Welding Activity Recognition (WAR) framework employing 9-DoF inertial sensing and a deployment-oriented evaluation protocol.Triaxial accelerometer, gyroscope, and magnetometer signals from ten certified welders were processed using Kalman filtering and 2-second sliding-window segmentation with 50% overlap under a subjectindependent scheme.Five classifiers, SVM, LightGBM, CNN, LSTM, and a hybrid CNN-LSTM were comparatively evaluated.The proposed CNN-LSTM achieved the highest accuracy of 97.45% at 50 Hz, demonstrating effective spatio-temporal feature modeling for multi-class welding posture recognition.Latency was rigorously defined under a unified sensor-to-decision framework, separating buffering and computational components.Despite lower inference latency in classical models, the proposed architecture maintained computational latency within the steady-state streaming interval, ensuring real-time feasibility.Additional analyses of sensor placement, sampling rate, and magnetometer inclusion provide further substantiate robust deployment in industrial environments.
- New
- Research Article
- 10.22266/ijies2026.0430.29
- Apr 30, 2026
- International Journal of Intelligent Engineering and Systems
Human Activity Recognition (HAR) is emerging as a critical enabler of context-aware applications in healthcare, fitness, and smart environments.In this research we present an approach that involves the Hierarchical Fusion with Decision Enhancement for Human Activity Recognition (Hi-FuseDE-HAR) framework.It contains four sequential hierarchical levels that transform raw sensor signals into a reliable HAR decision.At the input level, heterogeneous data streams are obtained from multiple wearable and ambient sensors.Applying Level 0 build the discriminative latent embedding for each sensor modality separately.Level 1 fuses across sensors in groupings and determines the value of using each modality and determines its relative contribution to the overall feature importance.Level 2 applies a Graph Cross-Modal Transformer that learns relationship between sensors groups producing a globally consistent fused representation.Level 3 provides decision enhancement through uncertainty calibration and utility aware optimization to ensure the final estimates based.Experimental results indicate that the proposed framework achieves 97.6% accuracy and 96.7% F1-score on the PAMAP2 dataset, 95.5% accuracy and 93.2% F1-score on the OPPORTUNITY dataset, and 96.5% accuracy and 95.2% F1-score on the MHEALTH dataset respectively.Notably, Hi-FuseDE-HAR retains strong performance confirming its capability to generalize across varied sensor contexts and complex activity patterns.
- New
- Research Article
- 10.1142/s021951942650048x
- Apr 24, 2026
- Journal of Mechanics in Medicine and Biology
- Chenxi Lu + 1 more
To address the problem of low recognition rate caused by the difficulty in capturing highspeed and subtle movements in table tennis, this work proposes a motion recognition method based on multimodal data and an optimized Spatial-Temporal Graph Convolutional Network (ST-GCN). The model introduces a Multi-Level Graph Convolutional Network (ML-GCN) architecture and constructs cross-level feature extraction channels, which effectively capture the spatiotemporal correlations between local subtle movements and global trajectories. The built-in hybrid attention mechanism realizes precise focusing on key skeletal nodes and core motion frames through adaptive weight assignment. Combined with the multimodal fusion strategy of visual signals and inertial sensor data, it significantly enhances the robustness of the model in scenarios with line-of-sight occlusion and motion blur. Test results based on a self-built multimodal table tennis dataset show that this method achieves an accuracy of 88.2%, a recall rate of 89.5% and an F1-score of 88.3%. This performance is significantly superior to the original ST-GCN and existing mainstream motion recognition algorithms, which confirms the core role of each optimization module in improving feature representation capability and computational efficiency. The study provides an efficient technical solution for the intelligent analysis of complex sports movements.
- New
- Research Article
- 10.1142/s0219519426500454
- Apr 23, 2026
- Journal of Mechanics in Medicine and Biology
- Chuihu Yin + 2 more
In college physical education teaching, college students’ athletic ability assessment still faces problems of strong subjectivity in scoring and difficulty in quantifying complex movement characteristics. Traditional methods struggle to capture multi-granularity information of skeletons in continuous movements, and the cost of acquiring annotated data is high. This study aims to construct a set of quantitative assessment models for athletic ability oriented to college physical education classrooms. It realizes the joint optimization of action recognition and athletic ability scoring through the multi-granularity spatial-temporal graph convolutional network (MGSTGC). The MG-STGC model uses an encoder to extract joint-level, limb-level, and body-level features. It combines labeled and unlabeled data via semi-supervised learning strategies to achieve the joint optimization of action recognition and quantitative assessment of athletic ability. The athletic ability assessment module can generate continuous scores across four dimensions: strength, stability, standardization, and coordination. These scores are obtained through spatiotemporal statistical mapping of historical action segments and skeleton features, providing data support for individualized training. On the NTU RGB+D dataset, MG-STGC achieves a Top-1 accuracy of 95.6% and 89.7% on the X-view and X-sub benchmarks. On the FineGym dataset, it reaches 80.5% Top-1 accuracy on the Gym99 subset and 75.4% on the Gym288 subset, with category average accuracies of 69.8% and 62.6%, respectively, outperforming baseline models. Ablation experiments show that the granularity information fusion module and parameters have an impact on model performance. Research shows that MGSTGC can efficiently capture multi-granularity information of action skeletons and provide an objective and quantitative method for athletic ability assessment in college physical education classrooms. MG-STGC also lays a feasible theoretical and practical foundation for intelligent physical education teaching and personalized training.