Improved First-Order Motion Model of Image Animation with Enhanced Dense Motion and Repair Ability

Yu Xu,Jianwen Chen,Qiang Liu,Feng Xu

doi:10.3390/app13074137

Abstract

Image animation aims to transfer the posture change of a driving video to the static object of the source image, and has potential applications in various domains, such as film and game industries. The essential part in this task is to generate a video by learning the motion from the driving video while preserving the appearance from the source image. As a result, a new object with the same motion will be generated in the animated video. However, it is a significant challenge if the object pose shows large-scale change. Even the most recent method failed to achieve this correctly with good visual effects. In order to solve the problem of poor visual effects in the videos with the large-scale pose change, a novel method based on an improved first-order motion model (FOMM) with enhanced dense motion and repair ability was proposed in this paper. Firstly, when generating optical flow, we propose an attention mechanism that optimizes the feature representation of the image in both channel and spatial domains through maximum pooling. This enables better distortion of the source image into the feature domain of the driving image. Secondly, we further propose a multi-scale occlusion restoration module that generates a multi-resolution occlusion map by upsampling the low-resolution occlusion map. Following this, the generator redraws the occluded part of the reconstruction result across multiple scales through the multi-resolution occlusion map to achieve more accurate and vivid visual effects. In addition, the proposed model can be trained effectively in an unsupervised manner. We evaluated the proposed model on three benchmark datasets. The experimental results showed that multiple evaluation indicators were improved by our proposed method, and the visual effect of the animated videos obviously outperformed the FOMM. On the Voxceleb1 dataset, the pixel error, average keypoints distance and average Euclidean distance by our proposed method were reduced by 6.5%, 5.1% and 0.7%, respectively. On the TaiChiHD dataset, the pixel error, average keypoints distance and missing keypoints rate measured by our proposed method were reduced by 4.9%, 13.5% and 25.8%, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Mar 24, 2023
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Improved First-Order Motion Model of Image Animation with Enhanced Dense Motion and Repair Ability

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Adaptive Ensemble Method Based on Spatial Characteristics for Classifying Imbalanced Data
Lei Wang ... Guan Gui
Scientific Programming | VOL. 2017
Lei Wang, et. al.Lei Wang ... Guan Gui
01 Jan 2017
Scientific Programming | VOL. 2017

Analysis of Eevee Engine Rendering Engineering in Making 3D Animation Videos Mukomuko Hospital
Eka Sahputra ... Muhardi Hari Sucahyo
Jurnal Komputer, Informasi dan Teknologi | VOL. 2
Eka Sahputra, et. al.Eka Sahputra ... Muhardi Hari Sucahyo
10 Dec 2022
Jurnal Komputer, Informasi dan Teknologi | VOL. 2

Exemplar-based human facial features cloning
Damon Shing-Min Liu ... Feng-Yi Lin
-
Damon Shing-Min Liu, et. al.Damon Shing-Min Liu ... Feng-Yi Lin
01 May 2017
01 May 2017

Decision tree algorithm based on average Euclidean distance
Quan Liu ... Qicui Yan
-
Quan Liu, et. al.Quan Liu ... Qicui Yan
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improved First-Order Motion Model of Image Animation with Enhanced Dense Motion and Repair Ability

Abstract

Talk to us

Similar Papers

More From: Applied Sciences