Learning a spatial-temporal texture transformer network for video inpainting.

Pengsen Ma,Tao Xue

doi:10.3389/fnbot.2022.1002453

Abstract

We study video inpainting, which aims to recover realistic textures from damaged frames. Recent progress has been made by taking other frames as references so that relevant textures can be transferred to damaged frames. However, existing video inpainting approaches neglect the ability of the model to extract information and reconstruct the content, resulting in the inability to reconstruct the textures that should be transferred accurately. In this paper, we propose a novel and effective spatial-temporal texture transformer network (STTTN) for video inpainting. STTTN consists of six closely related modules optimized for video inpainting tasks: feature similarity measure for more accurate frame pre-repair, an encoder with strong information extraction ability, embedding module for finding a correlation, coarse low-frequency feature transfer, refinement high-frequency feature transfer, and decoder with accurate content reconstruction ability. Such a design encourages joint feature learning across the input and reference frames. To demonstrate the advancedness and effectiveness of the proposed model, we conduct comprehensive ablation learning and qualitative and quantitative experiments on multiple datasets by using standard stationary masks and more realistic moving object masks. The excellent experimental results demonstrate the authenticity and reliability of the STTTN.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning a spatial-temporal texture transformer network for video inpainting.

Abstract

Talk to us

Similar Papers

More From: Frontiers in neurorobotics

Lead the way for us

Journal: Frontiers in neurorobotics	Publication Date: Oct 13, 2022
License type: cc-by

Similar Papers

G2LP-Net: Global to Local Progressive Video Inpainting Network
Zhong Ji ... Jiacheng Hou
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33
Zhong Ji, et. al.Zhong Ji ... Jiacheng Hou
01 Mar 2023
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33

Learning Joint Spatial-Temporal Transformations for Video Inpainting
Yanhong Zeng ... Hongyang Chao
-
Yanhong Zeng, et. al.Yanhong Zeng ... Hongyang Chao
01 Jan 2020
01 Jan 2020

A Hybrid Encoder Transformer Network for Video Inpainting
Hongshan Gan ... Yi Wan
-
Hongshan Gan, et. al.Hongshan Gan ... Yi Wan
18 Mar 2022
18 Mar 2022

Temporal Adaptive Alignment Network for Deep Video Inpainting
Ruixin Liu ... Zhenyu Weng
-
Ruixin Liu, et. al.Ruixin Liu ... Zhenyu Weng
01 Jul 2020
01 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning a spatial-temporal texture transformer network for video inpainting.

Abstract

Talk to us

Similar Papers

More From: Frontiers in neurorobotics