Abstract

Recently, deep learning has enabled a huge leap forward in image inpainting. However, due to the memory and computational limitation, most existing methods are able to handle only low-resolution inputs, typically less than 1 K. With the improvement of Internet transmission capacity and mobile device cameras, the resolution of image and video sources available to users via the cloud or locally is increasing. For high-resolution images, the common inpainting methods simply upsample the inpainted result of the shrinked image to yield a blurry result. In recent years, there is an urgent need to reconstruct the missing high-frequency information in high-resolution images and generate sharp texture details. Hence, we propose a general deep learning framework for high-resolution image inpainting, which first hallucinates a semantically continuous blurred result using low-resolution inpainting and suppresses computational overhead. Then the sharp high-frequency details with original resolution are reconstructed using super-resolution refinement. Experimentally, our method achieves inspiring inpainting quality on 2K and 4K resolution images, ahead of the state-of-the-art high-resolution inpainting technique. This framework is expected to be popularized for high-resolution image editing tasks on personal computers and mobile devices in the future.

Highlights

  • Image inpainting or image completion, which involves the automatic recovery of missing pixels of an image according to the known information within the image, is an important research area in computer vision

  • Results are compared against the state-of-the-art HR image inpainting technology (HiFill by Yi et al [17], CVPR 2020) both qualitatively and quantitatively

  • The LR inpainted map is fed into the SR network for frame inference to reconstruct the high frequency details at high resolution

Read more

Summary

Introduction

Image inpainting or image completion, which involves the automatic recovery of missing pixels of an image according to the known information within the image, is an important research area in computer vision. A stream of these methods hallucinates missing pixels using learned data distribution [9,10,11,18] Another stream fills the hole using a data-driven manner with the external image sources [12,13,14,15,16]. Though these methods can yield meaningful structure in missing regions, the generated regions are often blurred and accompanied by artifacts. There is an urgent need for methods that can reconstruct the missing high-frequency information in HR images and generate sharp texture details

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call