Abstract

Neural networks have improved the accuracy of human pose estimation from a single RGB image. However, such estimation remains difficult, especially when the human body is only partially visible due to a limited field of view of the camera or occlusions. In this paper, we introduce a data augmentation method called body-cropping augmentation (BCA), which generalizes the dataset for effective training in human pose estimation. This technique includes the policies of data generation and the training strategy using the augmented data. The experiments with the COCO val2017 dataset with ground-truth bounding boxes show BCA consistently enhances accuracies of state-of-the-art neural networks by an average of 1.08% without any modification to the network architecture. Moreover, the proposed BCA technique effectively reduces the false negatives of localizing keypoints, especially in an input image with a few visible keypoints.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call