Improving Small-Scale Human Action Recognition Performance Using a 3D Heatmap Volume.

Lin Yuan,Xiang Ma,Zhen He,Leiyang Xu,Qiang Wang

doi:10.3390/s23146364

Lin Yuan, Xiang Ma + Show 3 more

Open Access

https://doi.org/10.3390/s23146364

Copy DOI

Journal: Sensors (Basel, Switzerland)	Publication Date: Jul 13, 2023
Citations: 1	License type: CC BY 4.0

Affiliation: Harbin Institute of Technology

Abstract

In recent years, skeleton-based human action recognition has garnered significant research attention, with proposed recognition or segmentation methods typically validated on large-scale coarse-grained action datasets. However, there remains a lack of research on the recognition of small-scale fine-grained human actions using deep learning methods, which have greater practical significance. To address this gap, we propose a novel approach based on heatmap-based pseudo videos and a unified, general model applicable to all modality datasets. Leveraging anthropometric kinematics as prior information, we extract common human motion features among datasets through an ad hoc pre-trained model. To overcome joint mismatch issues, we partition the human skeleton into five parts, a simple yet effective technique for information sharing. Our approach is evaluated on two datasets, including the public Nursing Activities and our self-built Tai Chi Action dataset. Results from linear evaluation protocol and fine-tuned evaluation demonstrate that our pre-trained model effectively captures common motion features among human actions and achieves steady and precise accuracy across all training settings, while mitigating network overfitting. Notably, our model outperforms state-of-the-art models in recognition accuracy when fusing joint and limb modality features along the channel dimension.

Full Text