A survey on generative adversarial networks for imbalance problems in computer vision tasks

Vignesh Sampath,Iñaki Maurtua,Juan José Aguilar Martín,Aitor Gutierrez

doi:10.1186/s40537-021-00414-0

Vignesh Sampath, Iñaki Maurtua + Show 2 more

Open Access

PDF Available

https://doi.org/10.1186/s40537-021-00414-0

Copy DOI

Export

Save

Cite

Journal: Journal of Big Data	Publication Date: Jan 29, 2021
Citations: 140	License type: open-access

Affiliation: Tekniker, Universidad de Zaragoza

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Any computer vision application development starts off by acquiring images and data, then preprocessing and pattern recognition steps to perform a task. When the acquired images are highly imbalanced and not adequate, the desired task may not be achievable. Unfortunately, the occurrence of imbalance problems in acquired image datasets in certain complex real-world problems such as anomaly detection, emotion recognition, medical image analysis, fraud detection, metallic surface defect detection, disaster prediction, etc., are inevitable. The performance of computer vision algorithms can significantly deteriorate when the training dataset is imbalanced. In recent years, Generative Adversarial Neural Networks (GANs) have gained immense attention by researchers across a variety of application domains due to their capability to model complex real-world image data. It is particularly important that GANs can not only be used to generate synthetic images, but also its fascinating adversarial learning idea showed good potential in restoring balance in imbalanced datasets.In this paper, we examine the most recent developments of GANs based techniques for addressing imbalance problems in image data. The real-world challenges and implementations of synthetic image generation based on GANs are extensively covered in this survey. Our survey first introduces various imbalance problems in computer vision tasks and its existing solutions, and then examines key concepts such as deep generative image models and GANs. After that, we propose a taxonomy to summarize GANs based techniques for addressing imbalance problems in computer vision tasks into three major categories: 1. Image level imbalances in classification, 2. object level imbalances in object detection and 3. pixel level imbalances in segmentation tasks. We elaborate the imbalance problems of each group, and provide GANs based solutions in each group. Readers will understand how GANs based techniques can handle the problem of imbalances and boost performance of the computer vision algorithms.

Highlights

Recent developments in Convolutional Neural Networks (ConvNets) have led to substantial progress in the performance of computer vision tasks applied across various domains such as self-driving cars [1], medical imaging [2], agriculture [3, 4], Sampath et al J Big Data (2021) 8:27 manufacturing [5], etc
As opposed to other related surveys on class imbalance, that present class imbalance in tabular data, we focus on wide range of imbalance in high dimensional image data by following a systematic approach with a view to help researchers establish a detailed understanding of Generative Adversarial Neural Networks (GANs) based synthetic image generation for the imbalance problems in computer vision tasks
This paper surveys various GANs architectures that have been used for addressing the different imbalance problems in computer vision tasks

Summary

Introduction

Recent developments in Convolutional Neural Networks (ConvNets) have led to substantial progress in the performance of computer vision tasks applied across various domains such as self-driving cars [1], medical imaging [2], agriculture [3, 4], Sampath et al J Big Data (2021) 8:27 manufacturing [5], etc. In contrast to all the traditional approaches described above, Generative adversarial Neural Networks (GANs) aim to learn underlying true data distributions from the limited available images (both minority and majority class), and use the learned distributions to generate synthetic images. GANs utilize the ability of neural networks to learn a function that can approximate model distribution as close as possible to true distribution They do not rely on prior assumptions about the data distribution and can generate synthetic images with high visual fidelity. This significant property allows GANs to be applied to any kind of imbalance problem in computer vision tasks. OA-GAN is equipped with two GANs: The first GAN G1 is designed to disentangle the occlusion, and the second GAN G2 is trained to generate the occlusion free images given the generated occlusions

Findings

Discussion

Conclusion