Animating landscape

Yuki Endo,Shigeru Kuriyama,Yoshihiro Kanamori

doi:10.1145/3355089.3356523

Abstract

Automatic generation of a high-quality video from a single image remains a challenging task despite the recent advances in deep generative models. This paper proposes a method that can create a high-resolution, long-term animation using convolutional neural networks (CNNs) from a single landscape image where we mainly focus on skies and waters. Our key observation is that the motion (e.g., moving clouds) and appearance (e.g., time-varying colors in the sky) in natural scenes have different time scales. We thus learn them separately and predict them with decoupled control while handling future uncertainty in both predictions by introducing latent codes. Unlike previous methods that infer output frames directly, our CNNs predict spatially-smooth intermediate data, i.e., for motion, flow fields for warping, and for appearance, color transfer maps, via self-supervised learning, i.e., without explicitly-provided ground truth. These intermediate data are applied not to each previous output frame, but to the input image only once for each output frame. This design is crucial to alleviate error accumulation in long-term predictions, which is the essential problem in previous recurrent approaches. The output frames can be looped like cinemagraph, and also be controlled directly by specifying latent codes or indirectly via visual annotations. We demonstrate the effectiveness of our method through comparisons with the state-of-the-arts on video prediction as well as appearance manipulation. Resultant videos, codes, and datasets will be available at http://www.cgg.cs.tsukuba.ac.jp/~endo/projects/AnimatingLandscape.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Animating landscape

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Graphics

Lead the way for us

Journal: ACM Transactions on Graphics	Publication Date: Nov 8, 2019
Citations: 33

Similar Papers

Edge adaptive intra field de-interlacing of video images
Vladimir Lachine ... Gregory Smith
-
Vladimir Lachine, et. al.Vladimir Lachine ... Gregory Smith
21 Feb 2013
21 Feb 2013

Self-Supervised Transfer Learning from Natural Images for Sound Classification
Sungho Shin ... Yeonguk Yu
Applied Sciences | VOL. 11
Sungho Shin, et. al.Sungho Shin ... Yeonguk Yu
29 Mar 2021
Applied Sciences | VOL. 11

Exploring the Temporal Consistency of Arbitrary Style Transfer: A Channelwise Perspective.
Xiaoyu Kong ... Yongyong Chen
IEEE transactions on neural networks and learning systems | VOL. 35
Xiaoyu Kong, et. al.Xiaoyu Kong ... Yongyong Chen
01 Jun 2024
IEEE transactions on neural networks and learning systems | VOL. 35

Novel video stabilization for real-time optical character recognition applications
Yun Gu Lee
Journal of Visual Communication and Image Representation | VOL. 44
Yun Gu LeeYun Gu Lee
01 Feb 2017
Journal of Visual Communication and Image Representation | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Animating landscape

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Graphics