Free-Form Image Inpainting via Contrastive Attention Network

Xiaoqiang Zhou ,Xin Ma ,Ran He ,Huaibo Huang ,Zhenhua Chai ,Xiaolin Wei

doi:10.48448/ycsd-8381

Abstract

Most deep learning-based image inpainting approaches adopt autoencoder or its variants to ﬁll missing regions in images. Encoders are usually utilized to learn powerful representational spaces, which are important for dealing with sophisticated learning tasks. Speciﬁcally, in image inpainting tasks, masks with any shapes can appear anywhere in images (i.e., free-form masks) which form complex patterns. It is difﬁcult for encoders to capture such powerful representations under this complex situation. To tackle this problem, we propose a self-supervised Siamese inference network to improve the robustness and generalization. It can encode contextual semantics from full-resolution images and obtain more discriminative representations. we further propose a multi-scale decoder with a novel dual attention fusion module (DAF), which can combine both the restored and known regions in a smooth way. This multi-scale architecture is beneﬁcial for decoding discriminative representations learned by encoders into images layer by layer. In this way, unknown regions will be ﬁlled naturally from outside to inside.

Full Text