LIA: Latent Image Animator.

Yaohui Wang,Francois Bremond,Antitza Dantcheva,Di Yang

doi:10.1109/tpami.2024.3449075

Abstract

Previous animation techniques mainly focus on leveraging explicit structure representations (e.g., meshes or keypoints) for transferring motion from driving videos to source images. However, such methods are challenged with large appearance variations between source and driving data, as well as require complex additional modules to respectively model appearance and motion. Towards addressing these issues, we introduce the Latent Image Animator (LIA), streamlined to animate high-resolution images. LIA is designed as a simple autoencoder that does not rely on explicit representations. Motion transfer in the pixel space is modeled as linear navigation of motion codes in the latent space. Specifically such navigation is represented as an orthogonal motion dictionary learned in a self-supervised manner based on proposed Linear Motion Decomposition (LMD). Extensive experimental results demonstrate that LIA outperforms state-of-the-art on VoxCeleb, TaichiHD, and TED-talk datasets with respect to video quality and spatio-temporal consistency. In addition LIA is well equipped for zero-shot high-resolution image animation. Code, models, and demo video are available at https://github.com/wyhsirius/LIA.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LIA: Latent Image Animator.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on pattern analysis and machine intelligence

Lead the way for us

Journal: IEEE transactions on pattern analysis and machine intelligence	Publication Date: Dec 1, 2024
Citations: 3

Similar Papers

Discriminative learning based visual servoing across object instances
Harit Pandya ... K Madhava Krishna
-
Harit Pandya, et. al.Harit Pandya ... K Madhava Krishna
01 May 2016
01 May 2016

Rotation-Invariant Face Detection with Multi-task Progressive Calibration Networks
Li-Fang Zhou ... Yu Gu
-
Li-Fang Zhou, et. al.Li-Fang Zhou ... Yu Gu
01 Jan 2020
01 Jan 2020

Visual Object Tracking Based on Backward Model Validation
Yuan Yuan ... Weisi Lin
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 24
Yuan Yuan, et. al. Yuan Yuan ... Weisi Lin
01 Nov 2014
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 24

Joint Expression Synthesis and Representation Learning for Facial Expression Recognition
Xi Zhang ... Feifei Zhang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32
Xi Zhang, et. al.Xi Zhang ... Feifei Zhang
02 Feb 2021
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LIA: Latent Image Animator.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on pattern analysis and machine intelligence