Text to image synthesis using multi-generator text conditioned generative adversarial networks

Min Zhang,Chunye Li,Zhiping Zhou

doi:10.1007/s11042-020-09965-5

Abstract

Recently, Generative Adversarial Network(GAN) has been the most mainstream technology in the task of Text to Image. However, the vanilla deep neural networks tend to approximate continuous mappings in real generation tasks rather than discontinuous mappings with discrete points. When training on datasets with multiple types, GAN fails to synthesize diverse images, which we call as mode collapse. To deal with it, we propose the Multi-generator Text Conditioned Generative Adversarial Network (MTC-GAN) in this paper. Textual description of real images is embedded on the noise vector as a constraint. Based on Deep Convolutional Generative Adversarial Networks(DCGAN), multiple generators are incorporated to capture high probability among the target distribution. To identify the generated fake sample from a particular generator, the discriminator must enforce multiple generators to have different identifiable modes. The method based on global constraints can make the generated images more diverse. Multiple generators can improve the particular functional shape of the discriminators indirectly, which should make the GAN more stable when trained in high dimensional spaces. The experimental results on the standard dataset demonstrate the good performance of the proposed method. The problem of mode collapse can be improved, and the generated samples can be more diverse.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Text to image synthesis using multi-generator text conditioned generative adversarial networks

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Journal: Multimedia Tools and Applications	Publication Date: Oct 30, 2020
Citations: 11

Similar Papers

Study of Prevention of Mode Collapse in Generative Adversarial Network (GAN)
Bhagyashree ... G C Nandi
-
Bhagyashree, et. al. Bhagyashree ... G C Nandi
03 Dec 2020
03 Dec 2020

Data augmentation-based enhanced fingerprint recognition using deep convolutional generative adversarial network and diffusion models
Yukai Liu
Applied and Computational Engineering | VOL. 52
Yukai LiuYukai Liu
27 Mar 2024
Applied and Computational Engineering | VOL. 52

A variability aware GAN for improving spatial representativeness of discrete geobodies
Roozbeh Koochak ... Manouchehr Haghighi
Computers & Geosciences | VOL. 166
Roozbeh Koochak, et. al.Roozbeh Koochak ... Manouchehr Haghighi
14 Jul 2022
Computers & Geosciences | VOL. 166

Research and Application Analysis of Correlative Optimization Algorithms for GAN
Tianmeng Wang
Highlights in Science, Engineering and Technology | VOL. 57
Tianmeng WangTianmeng Wang
11 Jul 2023
Highlights in Science, Engineering and Technology | VOL. 57

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Text to image synthesis using multi-generator text conditioned generative adversarial networks

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications