Abstract

Based on the 3D Poisson equation, this paper extracts the features of the digital video human body action sequence. By solving the Poisson equation on the silhouette sequence, the time and space features, time and space structure features, shape features, and orientation features can be obtained. First, we use the silhouette structure features in three-dimensional space-time and the orientation features of the silhouette in three-dimensional space-time to represent the local features of the silhouette sequence and use the 3D Zernike moment feature to represent the overall features of the silhouette sequence. Secondly, we combine the Bayesian classifier and AdaBoost classifier to learn and classify the features of human action sequences, conduct experiments on the Weizmann video database, and conduct multiple experiments using the method of classifying samples and selecting partial combinations for training. Then, using the recognition algorithm of motion capture, after the above process, the three-dimensional model is obtained and matched with the model in the three-dimensional model database, the sequence with the smallest distance is calculated, and the corresponding skeleton is outputted as the results of action capture. During the experiment, the human motion tracking method based on the university matching kernel (EMK) image kernel descriptor was used; that is, the scale invariant operator was used to count the characteristics of multiple training images, and finally, the high-dimensional feature space was mapped into the low-dimensional to obtain the feature space approximating the Gaussian kernel. Based on the above analysis, the main user has prior knowledge of the network environment. The experimental results show that the method in this paper can effectively extract the characteristics of human body movements and has a good classification effect for bending, one-foot jumping, vertical jumping, waving, and other movements. Due to the linear separability of the data in the kernel space, fast linear interpolation regression is performed on the features in the feature space, which significantly improves the robustness and accuracy of the estimation of the human motion pose in the image sequence.

Highlights

  • Obtaining and analyzing various parameters of human motion from video image information is a key research direction of multidisciplinary fusion

  • Due to the relative ease of acquisition and wide range of applications, many existing video human motion tracking studies are dedicated to tracking the moving human body from the image sequence acquired by a single camera

  • Based on the existing research, this paper has conducted an in-depth analysis of the motion capture algorithm and technology

Read more

Summary

Introduction

Obtaining and analyzing various parameters of human motion from video image information is a key research direction of multidisciplinary fusion. Due to the relative ease of acquisition and wide range of applications, many existing video human motion tracking studies are dedicated to tracking the moving human body from the image sequence acquired by a single camera Most of these methods are based on extracting different features of the human body from image frames for matching and tracking. The main purpose of visual analysis is to detect, identify, and track the human body from a set of image sequences containing people and to understand and describe its behavior. This process can be divided into the underlying vision module level vision, data fusion module, and high-level vision module. Placing the digital human body model in a virtual production environment can well solve the practical problems of human factor engineering such as the accessibility, safety analysis, and standardization of operating actions for workers in production and assembly operations

Related Work
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call