Abstract

This paper provides an in-depth discussion of human motion recognition in Virtual Reality (VR) video sequences through hidden Markov models, which are four steps from VR video acquisition and pre-processing, foreground detection, extraction of human feature parameters, and hidden Markov model human motion recognition. A hybrid Gaussian model was used to build a background model in real-time based on changes in VR video information, and the image was subtracted by the background differential method. The optical flow method was used for foreground detection of the target, and the effects of sparse and dense optical flow were compared to obtain the motion characteristics and optical flow information of the target human body, respectively. Features were extracted for human motion, in terms of common geometric features of the body and optical flow information, respectively. In terms of common geometric information, the width-to-height ratio, perimeter-to-area ratio, center of mass, eccentricity, and feature angle were extracted, respectively. For the optical flow information, optical flow descriptors were constructed using a grid-based approach. And feature fusion was performed for the above two parameters by the k-means method to construct the word pocket model. The hidden Markov model parameters were obtained by using the hidden Markov model for the recognition of human motion and training the human feature parameters for each of the four motions. The recognition of the four common human body movements was realized by the forward-backward algorithm. The test results show that the motion recognition method in this paper has high recognition performance and good anti-interference performance. The time-sequence pooling is used to sort the effective video frame feature sequences to obtain the feature vectors that can represent the dynamic changes of video time sequence; finally, the time-sequence feature vectors are used to train the support vector machine for classification recognition. The recognition accuracy is 65.2% and 89.4% for the HMDB51 and UCF101 datasets, respectively.

Highlights

  • Human body motion recognition is a new research hotspot in the field of machine vision and artificial intelligence, which uses computer algorithms to automatically identify human body movements from the collected videos, i.e., to classify and label the video clips of human body movements [1]

  • Situation into the form of diagrams to judge the quality of the actions, which is widely used in intelligent driver assistance systems, motion analysis in sports, physical rehabilitation and physiotherapy [3]

  • As for the high-level motion recognition and understanding, mostly the sensor is placed in the human activity space, and the human motion information is obtained by perceiving the shift of the human position in the environment, this method does not need to wear the sensor in the use process, which is convenient for the user, it is not able to accurately judge the specific action of the user

Read more

Summary

Introduction

Human body motion recognition is a new research hotspot in the field of machine vision and artificial intelligence, which uses computer algorithms to automatically identify human body movements from the collected videos, i.e., to classify and label the video clips of human body movements [1]. L. Liu et al.: Key Algorithm for Human Motion Recognition in VR Video Sequences Based on HMM situation into the form of diagrams to judge the quality of the actions, which is widely used in intelligent driver assistance systems, motion analysis in sports, physical rehabilitation and physiotherapy [3]. As for the high-level motion recognition and understanding, mostly the sensor is placed in the human activity space, and the human motion information is obtained by perceiving the shift of the human position in the environment, this method does not need to wear the sensor in the use process, which is convenient for the user, it is not able to accurately judge the specific action of the user. The non-vision-based motion recognition, simpler to implement, has a limited application scope [8]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call