Temporal Group Fusion Network for Deep Video Inpainting

Ruixin Liu,Yuesheng Zhu,Bairong Li

doi:10.1109/tcsvt.2021.3117964

Ruixin Liu, Yuesheng Zhu + Show 1 more

Open Access

https://doi.org/10.1109/tcsvt.2021.3117964

Copy DOI

Abstract

Video inpainting is a task of synthesizing spatio-temporal coherent content in missing regions of the given video sequence, which has recently drawn increasing attention. To utilize the temporal information across frames, most recent deep learning-based methods align reference frames to target frame firstly with explicit or implicit motion estimation and then integrate the information from the aligned frames. However, their performance relies heavily on the accuracy of frame-to-frame alignment. To alleviate the above problem, in this paper, a novel Temporal Group Fusion Network (TGF-Net) is proposed to effectively integrate temporal information through a two-stage fusion strategy. Specifically, the input frames are reorganized into different groups, where each group is followed by an intra-group fusion module to integrate information within the group. Different groups provide complementary information for the missing region. A temporal attention model is further designed to adaptively integrate the information across groups. Such a temporal information fusion way gets rid of the dependence on alignment operations, greatly improving the visual quality and temporal consistency of the inpainted results. In addition, a coarse alignment model is introduced at the beginning of the network to handle videos with large motion. Extensive experiments on DAVIS and Youtube-VOS datasets demonstrate the superiority of our proposed method in terms of PSNR/SSIM values, visual quality and temporal consistency, respectively.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Jun 1, 2022
Citations: 8	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Temporal Group Fusion Network for Deep Video Inpainting

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Similar Papers

DAPC-Net: Deformable Alignment and Pyramid Context Completion Networks for Video Inpainting
Zhiliang Wu ... Jian Yang
IEEE Signal Processing Letters | VOL. 28
Zhiliang Wu, et. al.Zhiliang Wu ... Jian Yang
01 Jan 2020
IEEE Signal Processing Letters | VOL. 28

Temporal Adaptive Alignment Network for Deep Video Inpainting
Zhenyu Weng ... Ruixin Liu
-
Zhenyu Weng, et. al.Zhenyu Weng ... Ruixin Liu
01 Jul 2020
01 Jul 2020

Learning Joint Spatial-Temporal Transformations for Video Inpainting
Yanhong Zeng ... Jianlong Fu
-
Yanhong Zeng, et. al.Yanhong Zeng ... Jianlong Fu
01 Jan 2020
01 Jan 2020

Spatio-temporal image inpainting for video applications
Viacheslav Voronin ... Yigang Cen
Serbian Journal of Electrical Engineering | VOL. 14
Viacheslav Voronin, et. al.Viacheslav Voronin ... Yigang Cen
01 Jan 2017
Serbian Journal of Electrical Engineering | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Temporal Group Fusion Network for Deep Video Inpainting

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology