A Survey on GAN-Based Data Augmentation for Hand Pose Estimation Problem

Farnaz Farahanipad,Mohammad Sadegh Nasr,Vassilis Athitsos,Mohammad Rezaei,Farhad Kamangar

doi:10.3390/technologies10020043

Farnaz Farahanipad, Mohammad Sadegh Nasr + Show 3 more

Open Access

https://doi.org/10.3390/technologies10020043

Copy DOI

Journal: Technologies	Publication Date: Mar 21, 2022
Citations: 9	License type: CC BY 4.0

Affiliation: The University of Texas at Arlington

Abstract

Deep learning solutions for hand pose estimation are now very reliant on comprehensive datasets covering diverse camera perspectives, lighting conditions, shapes, and pose variations. While acquiring such datasets is a challenging task, several studies circumvent this problem by exploiting synthetic data, but this does not guarantee that they will work well in real situations mainly due to the gap between the distribution of synthetic and real data. One recent popular solution to the domain shift problem is learning the mapping function between different domains through generative adversarial networks. In this study, we present a comprehensive study on effective hand pose estimation approaches, which are comprised of the leveraged generative adversarial network (GAN), providing a comprehensive training dataset with different modalities. Benefiting from GAN, these algorithms can augment data to a variety of hand shapes and poses where data manipulation is intuitively controlled and greatly realistic. Next, we present related hand pose datasets and performance comparison of some of these methods for the hand pose estimation problem. The quantitative and qualitative results indicate that the state-of-the-art hand pose estimators can be greatly improved with the aid of the training data generated by these GAN-based data augmentation methods. These methods are able to beat the baseline approaches with better visual quality and higher values in most of the metrics (PCK and ME) on both the STB and NYU datasets. Finally, in conclusion, the limitation of the current methods and future directions are discussed.

Full Text