Neural style-preserving visual dubbing

Hyeongwoo Kim,Michael Zollhöfer,Christian Richardt,Hans-Peter Seidel,Mohamed Elgharib,Christian Theobalt,Thabo Beeler

doi:10.1145/3355089.3356500

Abstract

Dubbing is a technique for translating video content from one language to another. However, state-of-the-art visual dubbing techniques directly copy facial expressions from source to target actors without considering identity-specific idiosyncrasies such as a unique type of smile. We present a style-preserving visual dubbing approach from single video inputs, which maintains the signature style of target actors when modifying facial expressions, including mouth motions, to match foreign languages. At the heart of our approach is the concept of motion style, in particular for facial expressions, i.e., the person-specific expression change that is yet another essential factor beyond visual accuracy in face editing applications. Our method is based on a recurrent generative adversarial network that captures the spatiotemporal co-activation of facial expressions, and enables generating and modifying the facial expressions of the target actor while preserving their style. We train our model with unsynchronized source and target videos in an unsupervised manner using cycle-consistency and mouth expression losses, and synthesize photorealistic video frames using a layered neural face renderer. Our approach generates temporally coherent results, and handles dynamic backgrounds. Our results show that our dubbing approach maintains the idiosyncratic style of the target actor better than previous approaches, even for widely differing source and target actors.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ACM Transactions on Graphics	Publication Date: Nov 8, 2019
Citations: 65	License type: other-oa

R Discovery Prime

R Discovery Prime

Neural style-preserving visual dubbing

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Graphics

Lead the way for us

Similar Papers

Network centrality and social movement media coverage: A two-mode network analytic approach
Todd E Malinick ... Mario Diani
Social Networks | VOL. 35
Todd E Malinick, et. al.Todd E Malinick ... Mario Diani
03 Dec 2011
Social Networks | VOL. 35

Demo of FaceVR
Justus Thies ... Michael Zollhöfer
-
Justus Thies, et. al.Justus Thies ... Michael Zollhöfer
30 Jul 2017
30 Jul 2017

Deep video portraits
Hyeongwoo Kim ... Patrick Pérez
ACM Transactions on Graphics | VOL. 37
Hyeongwoo Kim, et. al.Hyeongwoo Kim ... Patrick Pérez
30 Jul 2018
ACM Transactions on Graphics | VOL. 37

Short‐term power forecasting method for 5G photovoltaic base stations on non‐sunny days based on SDN‐integrated INGO‐BP and RGAN
Jinbao Huang ... Tuanfa Qin
IET Renewable Power Generation | VOL. 18
Jinbao Huang, et. al.Jinbao Huang ... Tuanfa Qin
14 Mar 2024
IET Renewable Power Generation | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neural style-preserving visual dubbing

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Graphics