Controllable Image Synthesis With Attribute-Decomposed GAN.

Guo Pu,Wei-Ying Ma,Zhouhui Lian,Yifang Men,Yuning Jiang,Yiming Mao

doi:10.1109/tpami.2022.3161985

Abstract

This paper proposes Attribute-Decomposed GAN (ADGAN) and its enhanced version (ADGAN++) for controllable image synthesis, which can produce realistic images with desired attributes provided in various source inputs. The core ideas of the proposed ADGAN and ADGAN++ are both to embed component attributes into the latent space as independent codes and thus achieve flexible and continuous control of attributes via mixing and interpolation operations in explicit style representations. The major difference between them is that ADGAN processes all component attributes simultaneously while ADGAN++ utilizes a serial encoding strategy. More specifically, ADGAN consists of two encoding pathways with style block connections and is capable of decomposing the original hard mapping into multiple more accessible subtasks. In the source pathway, component layouts are extracted via a semantic parser and the segmented components are fed into a shared global texture encoder to obtain decomposed latent codes. This strategy allows for the synthesis of more realistic output images and the automatic separation of un-annotated component attributes. Although the original ADGAN works in a delicate and efficient manner, intrinsically it fails to handle the semantic image synthesizing task when the number of attribute categories is huge. To address this problem, ADGAN++ employs the serial encoding of different component attributes to synthesize each part of the target real-world image, and adopts several residual blocks with segmentation guided instance normalization to assemble the synthesized component images and refine the original synthesis result. The two-stage ADGAN++ is designed to alleviate the massive computational costs required when synthesizing real-world images with numerous attributes while maintaining the disentanglement of different attributes to enable flexible control of arbitrary component attributes of the synthesized images. Experimental results demonstrate the proposed methods' superiority over the state of the art in pose transfer, face style transfer, and semantic image synthesis, as well as their effectiveness in the task of component attribute transfer. Our code and data are publicly available at https://github.com/menyifang/ADGAN.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Controllable Image Synthesis With Attribute-Decomposed GAN.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Feb 1, 2023
Citations: 8

Similar Papers

Controllable Person Image Synthesis With Attribute-Decomposed GAN
Yifang Men ... Wei-Ying Ma
-
Yifang Men, et. al.Yifang Men ... Wei-Ying Ma
01 Jun 2020
01 Jun 2020

Lightweight semantic image synthesis with mutable framework
Yanxiang Gong ... Mei Xie
Journal of Electronic Imaging | VOL. 32
Yanxiang Gong, et. al.Yanxiang Gong ... Mei Xie
14 Jun 2023
Journal of Electronic Imaging | VOL. 32

Dual conditional GAN based on external attention for semantic image synthesis
Gang Liu ... Qingchen Yu
Connection Science | VOL. 35
Gang Liu, et. al.Gang Liu ... Qingchen Yu
04 Oct 2023
Connection Science | VOL. 35

MUSH: Multi-scale Hierarchical Feature Extraction for Semantic Image Synthesis
Zicong Wang ... Changjun Jiang
-
Zicong Wang, et. al.Zicong Wang ... Changjun Jiang
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Controllable Image Synthesis With Attribute-Decomposed GAN.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence