A Data Augmentation Method for Skeleton-Based Action Recognition with Relative Features

Junjie Chen,Wei Yang,Leiyue Yao,Chenqi Liu

doi:10.3390/app112311481

Junjie Chen, Wei Yang + Show 2 more

Open Access

https://doi.org/10.3390/app112311481

Copy DOI

Abstract

In recent years, skeleton-based human action recognition (HAR) approaches using convolutional neural network (CNN) models have made tremendous progress in computer vision applications. However, using relative features to depict human actions, in addition to preventing overfitting when the CNN model is trained on a few samples, is still a challenge. In this paper, a new motion image is introduced to transform spatial-temporal motion information into image-based representations. For each skeleton sequence, three relative features are extracted to describe human actions. The three relative features are consisted of relative coordinates, immediate displacement, and immediate motion orientation. In particular, the relative coordinates introduced in our paper not only depict the spatial relations of human skeleton joints but also provide long-term temporal information. To address the problem of small sample sizes, a data augmentation strategy consisting of three simple but effective data augmentation methods is proposed to expand the training samples. Because the generated color images are small in size, a shallow CNN model is suitable to extract the deep features of the generated motion images. Two small-scale but challenging skeleton datasets were used to evaluate the method, scoring 96.59% and 97.48% on the Florence 3D Actions dataset and UTkinect-Action 3D dataset, respectively. The results show that the proposed method achieved a competitive performance compared with the state-of-the-art methods. Furthermore, the augmentation strategy proposed in this paper effectively solves the overfitting problem and can be widely adopted in skeleton-based action recognition.

Highlights

Introductionhuman action recognition (HAR) has received increasing attention in the field of computer vision
In recent years, human action recognition (HAR) has received increasing attention in the field of computer vision.Because it has a wide range of industrial applications, such as human computer interaction, smart video surveillance, and health care [1]
According to [24,25], as the motion color image is small in size and simple in structure, a shallow convolutional neural network (CNN) framework is enough to extract the deep features of the generated motion images for HAR

Summary

Introduction

HAR has received increasing attention in the field of computer vision. To form a final feature descriptor, HAR using handcrafted features has two stages, feature extraction and feature representation In the former stage, various sorts and varieties of motion features are proposed, such as the relative coordinates and angles between joints. To address the mentioned issues, we propose a novel skeleton-based action recognition method using a CNN model. The spatial-temporal motion data of human action are encoded into an image-based representation. To cope with varyinglength skeleton sequences, an effective skeleton sequence refinement strategy is utilized to align action sequences By this way, the generated motion images have consistent spatial. A shallow CNN model is sufficient to efficiently extract deep features because of the small size of our proposed skeleton-based motion image.

Related Work

Method

Data Augmentation

Mimicking

Mimicking a Certain Action That Was Executed by People of Different Sizes

Feature Extraction

Relative Coordinate

Immediate Motion Orientation

Immediate Displacement

Image Encoding

Network Model

Experiment

Datasets

Comparison of Different Frames of Skeleton Sequence

2.19 MFLOPs

Demonstrate Effectiveness of Data Augmentation

Comparison of Original Coordinates and Relative Coordinates

Comparison of the Normalization Function

Comparison of State-of-the-Art Methods

Methods

Discussion

Findings

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Dec 3, 2021
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Data Augmentation Method for Skeleton-Based Action Recognition with Relative Features

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

A Real-Time and Hardware-Efficient Processor for Skeleton-Based Action Recognition With Lightweight Convolutional Neural Network
Bingyi Zhang ... Jianwei Yang
IEEE Transactions on Circuits and Systems II: Express Briefs | VOL. 66
Bingyi Zhang, et. al.Bingyi Zhang ... Jianwei Yang
01 Dec 2019
IEEE Transactions on Circuits and Systems II: Express Briefs | VOL. 66

Human Activity Recognition Through Ensemble Learning of Multiple Convolutional Neural Networks
Narjis Zehra ... Muhammad Farhan
-
Narjis Zehra, et. al.Narjis Zehra ... Muhammad Farhan
24 Mar 2021
24 Mar 2021

Identification of slightly sprouted wheat kernels using hyperspectral imaging technology and different deep convolutional neural networks
Jingwu Zhu ... Haiyan Ji
Food Control | VOL. 143
Jingwu Zhu, et. al.Jingwu Zhu ... Haiyan Ji
04 Aug 2022
Food Control | VOL. 143

Human action recognition using Lie Group features and convolutional neural networks
Linqin Cai ... Heen Ding
Nonlinear Dynamics | VOL. 99
Linqin Cai, et. al.Linqin Cai ... Heen Ding
16 Jan 2020
Nonlinear Dynamics | VOL. 99

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Data Augmentation Method for Skeleton-Based Action Recognition with Relative Features

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences