A Latent Transformer for Disentangled Face Editing in Images and Videos

Xu Yao,Pierre Hellier,Alasdair Newson,Yann Gousseau

doi:10.1109/iccv48922.2021.01353

Abstract

High quality facial image editing is a challenging problem in the movie post-production industry, requiring a high degree of control and identity preservation. Previous works that attempt to tackle this problem may suffer from the entanglement of facial attributes and the loss of the person’s identity. Furthermore, many algorithms are limited to a certain task. To tackle these limitations, we propose to edit facial attributes via the latent space of a StyleGAN generator, by training a dedicated latent transformation network and incorporating explicit disentanglement and identity preservation terms in the loss function. We further introduce a pipeline to generalize our face editing to videos. Our model achieves a disentangled, controllable, and identity-preserving facial attribute editing, even in the challenging case of real (i.e., non-synthetic) images and videos. We conduct extensive experiments on image and video datasets and show that our model outperforms other state-of-the-art methods in visual quality and quantitative evaluation. Source codes are available at https://github.com/InterDigitalInc/latent-transformer.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Latent Transformer for Disentangled Face Editing in Images and Videos

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Identity and Attribute Preserving Thumbnail Upscaling
Noam Gat ... Lior Wolf
-
Noam Gat, et. al.Noam Gat ... Lior Wolf
19 Sep 2021
19 Sep 2021

Predicting Facial Attributes in Video Using Temporal Coherence and Motion-Attention
Emily M Hand ... Rama Chellappa
-
Emily M Hand, et. al.Emily M Hand ... Rama Chellappa
01 Mar 2018
01 Mar 2018

Hierarchical Color Fusion Network (HCFN): Enhancing exemplar-based video colorization
Wang Yin ... Jinbei Yu
Neurocomputing | VOL. 598
Wang Yin, et. al.Wang Yin ... Jinbei Yu
28 Jun 2024
Neurocomputing | VOL. 598

FineStyle: Semantic-Aware Fine-Grained Motion Style Transfer with Dual Interactive-Flow Fusion.
Wenfeng Song ... Shuai Li
IEEE Transactions on Visualization and Computer Graphics | VOL. PP
Wenfeng Song, et. al.Wenfeng Song ... Shuai Li
01 Nov 2023
IEEE Transactions on Visualization and Computer Graphics | VOL. PP

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Latent Transformer for Disentangled Face Editing in Images and Videos

Abstract

Talk to us

Similar Papers