Multi‐dimensional data modelling of video image action recognition and motion capture in deep learning framework

Peijun Gao,Xuanang Chen,Dan Zhao

doi:10.1049/iet-ipr.2019.0588

Peijun Gao, Xuanang Chen + Show 1 more

Open Access

PDF Available

https://doi.org/10.1049/iet-ipr.2019.0588

Copy DOI

Export

Save

Cite

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

In order to improve the accuracy of small-range human motion recognition in video and the computational efficiency of large-scale data sets, a multi-dimensional data model of motion recognition and motion capture in video image based on deep-learning framework was proposed. First, the moving foreground of the target is extracted by the Gauss mixture model, and the human body is recognised by the gradient histogram. At the second level, the dense trajectory feature and the deep learning feature are fused, according to the integration of global encoding algorithm and convolutional neural network. In the deep learning feature, the fusion of the deep video feature and the video RGB tricolour feature is taken as the feature of deep learning. Finally, the classification is based on the deep learning network model. The simulation experiments based on large-scale real data sets and small-scale gesture data sets show that the algorithm has high recognition accuracy for large-scale data sets and small-scale gesture actions. In addition, Imperial Computer Vision & Learning Lab human behaviour data set is used to classify the experimental data. The average classification accuracy is 85.79%. The algorithm can run at a speed of about 20 frames per second.

Full Text