Text-to-image Synthesis via Symmetrical Distillation Networks

Mingkuan Yuan,Yuxin Peng

doi:10.1145/3240508.3240559

Abstract

Text-to-image synthesis aims to automatically generate images according to text descriptions given by users, which is a highly challenging task. The main issues of text-to-image synthesis lie in two gaps: the heterogeneous and homogeneous gaps. The heterogeneous gap is between the high-level concepts of text descriptions and the pixel-level contents of images, while the homogeneous gap exists between synthetic image distributions and real image distributions. For addressing these problems, we exploit the excellent capability of generic discriminative models (e.g. VGG19), which can guide the training process of a new generative model on multiple levels to bridge the two gaps. The high-level representations can teach the generative model to extract necessary visual information from text descriptions, which can bridge the heterogeneous gap. The mid-level and low-level representations can lead it to learn structures and details of images respectively, which relieves the homogeneous gap. Therefore, we propose Symmetrical Distillation Networks (SDN) composed of a source discriminative model as "teacher" and a target generative model as "student". The target generative model has a symmetrical structure with the source discriminative model, in order to transfer hierarchical knowledge accessibly. Moreover, we decompose the training process into two stages with different distillation paradigms for promoting the performance of the target generative model. Experiments on two widely-used datasets are conducted to verify the effectiveness of our proposed SDN.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Text-to-image Synthesis via Symmetrical Distillation Networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

DeepCollaboration: Collaborative Generative and Discriminative Models for Class Incremental Learning
Bo Cui ... Shan Yu
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Bo Cui, et. al.Bo Cui ... Shan Yu
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Combining deep generative and discriminative models for Bayesian semi-supervised learning
Jonathan Gordon ... José Miguel Hernández-Lobato
Pattern Recognition | VOL. 100
Jonathan Gordon, et. al.Jonathan Gordon ... José Miguel Hernández-Lobato
14 Dec 2019
Pattern Recognition | VOL. 100

Research on Text to Image Based on Generative Adversarial Network
Li Xiaolin ... Gao Yuwei
-
Li Xiaolin, et. al.Li Xiaolin ... Gao Yuwei
01 Dec 2020
01 Dec 2020

Discriminative models for semi-supervised natural language learning
Sajib Dasgupta ... Vincent Ng
-
Sajib Dasgupta, et. al.Sajib Dasgupta ... Vincent Ng
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Text-to-image Synthesis via Symmetrical Distillation Networks

Abstract

Talk to us

Similar Papers