Deconstructing Generative Adversarial Networks

Banghua Zhu,David Tse,Jiantao Jiao

doi:10.1109/tit.2020.2983698

Abstract

Generative Adversarial Networks (GANs) are a thriving unsupervised machine learning technique that has led to significant advances in various fields such as computer vision, natural language processing, among others. However, GANs are known to be difficult to train and usually suffer from mode collapse and the discriminator winning problem. To interpret the empirical observations of GANs and design better ones, we deconstruct the study of GANs into three components and make the following contributions. Formulation: we propose a perturbation view of the population target of GANs. Building on this interpretation, we show that GANs can be connected to the robust statistics framework, and propose a novel GAN architecture, termed as Cascade GANs, to provably recover meaningful low-dimensional generator approximations when the real distribution is high-dimensional and corrupted by outliers. Generalization: given a population target of GANs, we design a systematic principle, projection under admissible distance, to design GANs to meet the population requirement using only finite samples. We implement our principle in three cases to achieve polynomial and sometimes near-optimal sample complexities: (1) learning an arbitrary generator under an arbitrary pseudonorm; (2) learning a Gaussian location family under total variation distance, where we utilize our principle to provide a new proof for the near-optimality of the Tukey median viewed as GANs; (3) learning a low-dimensional Gaussian approximation of a high-dimensional arbitrary distribution under Wasserstein distance. We demonstrate a fundamental trade-off in the approximation error and statistical error in GANs, and demonstrate how to apply our principle in practice with only empirical samples to predict how many samples would be sufficient for GANs in order not to suffer from the discriminator winning problem. Optimization: we demonstrate alternating gradient descent is provably not locally asymptotically stable in optimizing the GAN formulation of PCA. We found that the minimax duality gap being non-zero might be one of the causes, and propose a new GAN architecture whose duality gap is zero, where the value of the game is equal to the previous minimax value (not the maximin value). We prove the new GAN architecture is globally asymptotically stable in solving PCA under alternating gradient descent.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Information Theory	Publication Date: Nov 1, 2020
Citations: 53	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Deconstructing Generative Adversarial Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Information Theory

Lead the way for us

Similar Papers

Study of Prevention of Mode Collapse in Generative Adversarial Network (GAN)
Bhagyashree ... G C Nandi
-
Bhagyashree, et. al. Bhagyashree ... G C Nandi
03 Dec 2020
03 Dec 2020

A variability aware GAN for improving spatial representativeness of discrete geobodies
Roozbeh Koochak ... Manouchehr Haghighi
Computers & Geosciences | VOL. 166
Roozbeh Koochak, et. al.Roozbeh Koochak ... Manouchehr Haghighi
14 Jul 2022
Computers & Geosciences | VOL. 166

Attacking Bitcoin anonymity: generative adversarial networks for improving Bitcoin entity classification
Francesco Zola ... Lander Segurola-Gil
Applied Intelligence | VOL. 52
Francesco Zola, et. al.Francesco Zola ... Lander Segurola-Gil
01 Apr 2022
Applied Intelligence | VOL. 52

GAN-LSTM Predictor for Failure Prognostics of Rolling Element Bearings
Hao Lu ... Chao Hu
-
Hao Lu, et. al.Hao Lu ... Chao Hu
07 Jun 2021
07 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deconstructing Generative Adversarial Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Information Theory