Feature pre-inpainting enhanced transformer for video inpainting

Guanxiao Li,Ke Zhang,Yu Su,Jingyu Wang

doi:10.1016/j.engappai.2023.106323

Abstract

Transformer-based video inpainting methods aggregate coherent contents into missing regions by learning dependencies spatial–temporally. However, existing methods suffer from the inaccurate self-attention calculation and excessive quadratic computational complexity, due to uninformative representations of missing regions and inefficient global self-attention mechanisms, respectively. To mitigate these problems, we propose a Feature pre-Inpainting enhanced Transformer (FITer) video inpainting method, in which the feature pre-inpainting network (FPNet) and local–global interleaving Transformer are designed. The FPNet pre-inpaints missing features before the Transformer by exploiting spatial context, and the representations of missing regions are thus enhanced with more informative content. Therefore, the interleaving Transformer can calculate more accurate self-attention weights and learns more effective dependencies between missing and valid regions. Since the interleaving Transformer involves both global and window-based local self-attention mechanisms, the proposed FITer method can effectively aggregate spatial–temporal features into missing regions while improving efficiency. Experiments on YouTube-VOS and DAVIS datasets demonstrate that the FITer method outperforms previous methods qualitatively and quantitatively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Feature pre-inpainting enhanced transformer for video inpainting

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence

Lead the way for us

Journal: Engineering Applications of Artificial Intelligence	Publication Date: Apr 24, 2023
Citations: 3

Similar Papers

Deep Flow-Guided Video Inpainting
Rui Xu ... Xiaoxiao Li
-
Rui Xu, et. al.Rui Xu ... Xiaoxiao Li
01 Jun 2019
01 Jun 2019

Inpainting Digital Dunhuang Murals with Structure-Guided Deep Network
Zhiheng Zhou ... Xinran Liu
Journal on Computing and Cultural Heritage | VOL. 15
Zhiheng Zhou, et. al.Zhiheng Zhou ... Xinran Liu
06 Dec 2022
Journal on Computing and Cultural Heritage | VOL. 15

DARGS: Image inpainting algorithm via deep attention residuals group and semantics
Yuantao Chen ... Ke Zou
Journal of King Saud University - Computer and Information Sciences | VOL. 35
Yuantao Chen, et. al.Yuantao Chen ... Ke Zou
25 Apr 2023
Journal of King Saud University - Computer and Information Sciences | VOL. 35

FSTT: Flow-Guided Spatial Temporal Transformer for Deep Video Inpainting
Ruixin Liu ... Yuesheng Zhu
Electronics | VOL. 12
Ruixin Liu, et. al.Ruixin Liu ... Yuesheng Zhu
29 Oct 2023
Electronics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature pre-inpainting enhanced transformer for video inpainting

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence