Abstract

With the rapid development of convolutional neural networks (CNNs), the performance of human pose estimation has been significantly improved. However, state-of-the-art methods still face specific challenges, such as occluded keypoints and nearby persons. Unlike the classical network improvement strategy, our approach obtains adaptive data to obtain more accurate keypoints through data augmentation. In this paper, we propose Adaptive Data Augmentation Network (ADA-Net), which brings more adaptive training data by adding occlusion and interleaving information to the original image. First, we introduce Active Transmission Network (ATNet), which actively learns the transformation matrix from the original image and uses the matrix to synthesize new images for training. Second, we adopt an adversarial training strategy combined with the ATNet that allows us to capture more challenging cases. Extensive experiments show that our approach achieves comparable or even better results than most state-of-the-art methods. In particular, our ADA-Net outperforms High-Resolution Network (HRNet) by 1.1 and 0.7 points on the COCO test-dev set and MPII validation set, respectively.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call