Controllable Video Generation Through Global and Local Motion Dynamics

Aram Davtyan,Paolo Favaro

doi:10.1007/978-3-031-19790-1_5

Abstract

AbstractWe present GLASS, a method for Global and Local Action-driven Sequence Synthesis. GLASS is a generative model that is trained on video sequences in an unsupervised manner and that can animate an input image at test time. The method learns to segment frames into foreground-background layers and to generate transitions of the foregrounds over time through a global and local action representation. Global actions are explicitly related to 2D shifts, while local actions are instead related to (both geometric and photometric) local deformations. GLASS uses a recurrent neural network to transition between frames and is trained through a reconstruction loss. We also introduce W-Sprites (Walking Sprites), a novel synthetic dataset with a predefined action space. We evaluate our method on both W-Sprites and real datasets, and find that GLASS is able to generate realistic video sequences from a single input image and to successfully learn a more advanced action space than in prior work. Further details, the code and example videos are available at https://araachie.github.io/glass/.KeywordsVideo generationUnsupervised action discoveryControllable generation

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Controllable Video Generation Through Global and Local Motion Dynamics

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Transdisciplinary approaches to local sustainability: aligning local governance and navigating spillovers with global action towards the Sustainable Development Goals
Reihaneh Bandari ... Brett A Bryan
Sustainability Science | VOL. 19
Reihaneh Bandari, et. al.Reihaneh Bandari ... Brett A Bryan
23 Apr 2024
Sustainability Science | VOL. 19

Avoiding Coral Reef Functional Collapse Requires Local and Global Action
Emma V Kennedy ... Peter J Mumby
Current Biology | VOL. 23
Emma V Kennedy, et. al.Emma V Kennedy ... Peter J Mumby
01 May 2013
Current Biology | VOL. 23

Document rectification and illumination correction using a patch-based CNN
Xiaoyu Li ... Jing Liao
ACM Transactions on Graphics | VOL. 38
Xiaoyu Li, et. al.Xiaoyu Li ... Jing Liao
08 Nov 2019
ACM Transactions on Graphics | VOL. 38

Real and synthetic data sets for benchmarking key-value stores focusing on various data types and sizes
Hyuk-Yoon Kwon
Data in Brief | VOL. 30
Hyuk-Yoon KwonHyuk-Yoon Kwon
20 Mar 2020
Data in Brief | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Controllable Video Generation Through Global and Local Motion Dynamics

Abstract

Talk to us

Similar Papers