Abstract

The use of computer vision for target detection and recognition has been an interesting and challenging area of research for the past three decades. Professional athletes and sports enthusiasts in general can be trained with appropriate systems for corrective training and assistive training. Such a need has motivated researchers to combine artificial intelligence with the field of sports to conduct research. In this paper, we propose a Mask Region-Convolutional Neural Network (MR-CNN)- based method for yoga movement recognition based on the image task of yoga movement recognition. The improved MR-CNN model is based on the framework and structure of the region-convolutional network, which proposes a certain number of candidate regions for the image by feature extraction and classifies them, then outputs these regions as detected bounding boxes, and does mask prediction for the candidate regions using segmentation branches. The improved MR-CNN model uses an improved deep residual network as the backbone network for feature extraction, bilinear interpolation of the extracted candidate regions using Region of Interest (RoI) Align, followed by target classification and detection, and segmentation of the image using the segmentation branch. The model improves the convolution part in the segmentation branch by replacing the original standard convolution with a depth-separable convolution to improve the network efficiency. Experimentally constructed polygon-labeled datasets are simulated using the algorithm. The deepening of the network and the use of depth-separable network improve the accuracy of detection while maintaining the reliability of the network and validate the effectiveness of the improved MR-CNN.

Highlights

  • Hui WangReceived 24 July 2021; Revised 19 August 2021; Accepted 26 August 2021; Published 30 October 2021

  • Gigabytes of images are generated every day in the Internet, which contain a huge amount of information

  • Image retrieval has become very active in related research areas since the 1970s. e advancement of image retrieval is inseparable from the development of database management systems and the effective promotion of computer vision as a field

Read more

Summary

Hui Wang

Received 24 July 2021; Revised 19 August 2021; Accepted 26 August 2021; Published 30 October 2021. Professional athletes and sports enthusiasts in general can be trained with appropriate systems for corrective training and assistive training. E improved MR-CNN model is based on the framework and structure of the region-convolutional network, which proposes a certain number of candidate regions for the image by feature extraction and classifies them, outputs these regions as detected bounding boxes, and does mask prediction for the candidate regions using segmentation branches. E improved MR-CNN model uses an improved deep residual network as the backbone network for feature extraction, bilinear interpolation of the extracted candidate regions using Region of Interest (RoI) Align, followed by target classification and detection, and segmentation of the image using the segmentation branch. Constructed polygon-labeled datasets are simulated using the algorithm. e deepening of the network and the use of depth-separable network improve the accuracy of detection while maintaining the reliability of the network and validate the effectiveness of the improved MR-CNN

Introduction
Related Works
Sliding window
Bounding box Segmentation result
Number of anchors Average error
Number of iterations
Recall rate
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call