Abstract
Inferring human activities from the skeletons extracted from activity photos or videos is a fundamental yet important issue in the research community of computer vision. Current skeleton-based activity recognition methods have accomplished this task by extracting features in either spatial or temporal perspective, or combining both views; however, the extracted features lack further after-processing like feature augmentation. Aiming at the above shortcomings, this article presents a new human activity recognition approach based on human skeleton data, named Convolutional Block Attention Module-based Spatial Temporal Convolutional Network (CBAM-STGCN). To enhance the discriminative validity of the generated features, we leverage the attention mechanism to weight feature units within each dimension by introducing two attention modules designed in terms of different aspects into the model architecture. Specifically, the original feature map of an activity sample is adaptively refined by the convolutional block attention module, where two separate attention maps obtained with regard to spatial and channel dimensions are adopted to generate the corresponding attended feature maps. Evaluation results on the Kinetics benchmark dataset showed that the proposed model has improved 1.76 % and 2.43 % according to top-1 and top-5 metrics, respectively, compared to the conventional baseline model. Further ablation experiments proved that both channel and spatial attention modules contribute to the model performance.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.