Controllable Image-to-Video Translation: A Case Study on Facial Expression Generation

Lijie Fan,Wenbing Huang,Junzhou Huang,Chuang Gan,Boqing Gong

doi:10.1609/aaai.v33i01.33013510

Abstract

The recent advances in deep learning have made it possible to generate photo-realistic images by using neural networks and even to extrapolate video frames from an input video clip. In this paper, for the sake of both furthering this exploration and our own interest in a realistic application, we study imageto-video translation and particularly focus on the videos of facial expressions. This problem challenges the deep neural networks by another temporal dimension comparing to the image-to-image translation. Moreover, its single input image fails most existing video generation methods that rely on recurrent models. We propose a user-controllable approach so as to generate video clips of various lengths from a single face image. The lengths and types of the expressions are controlled by users. To this end, we design a novel neural network architecture that can incorporate the user input into its skip connections and propose several improvements to the adversarial training method for the neural network. Experiments and user studies verify the effectiveness of our approach. Especially, we would like to highlight that even for the face images in the wild (downloaded from the Web and the authors’ own photos), our model can generate high-quality facial expression videos of which about 50% are labeled as real by Amazon Mechanical Turk workers.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Controllable Image-to-Video Translation: A Case Study on Facial Expression Generation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jul 17, 2019
Citations: 28

Similar Papers

Image-to-Video Generation via 3D Facial Dynamics
Xiaoguang Tu ... Zhifeng Li
IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society | VOL. 32
Xiaoguang Tu, et. al.Xiaoguang Tu ... Zhifeng Li
27 May 2021
IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society | VOL. 32

Deep Bayesian active learning with image data
...
-
, et. al. ...
27 Nov 2017
27 Nov 2017

3-Dimensional Face from a Single Face Image with Various Expressions
Yu-Jin Hong ... Junghyun Cho
-
Yu-Jin Hong, et. al.Yu-Jin Hong ... Junghyun Cho
01 Jan 2015
01 Jan 2015

Quantitative Assessment of Perceived Visibility Enhancement with Image Processing for Single Face Images: A Preliminary Study
Ming Mei ... Susan J Leat
Investigative Opthalmology & Visual Science | VOL. 50
Ming Mei, et. al.Ming Mei ... Susan J Leat
15 Apr 2009
Investigative Opthalmology & Visual Science | VOL. 50

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Controllable Image-to-Video Translation: A Case Study on Facial Expression Generation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence