Abstract

In this paper we address the problem of hallucinating high-resolution facial images from low-resolution inputs at high magnification factors. We approach this task with convolutional neural networks (CNNs) and propose a novel (deep) face hallucination model that incorporates identity priors into the learning procedure. The model consists of two main parts: i) a cascaded super-resolution network that upscales the low-resolution facial images, and ii) an ensemble of face recognition models that act as identity priors for the super-resolution network during training. Different from most competing super-resolution techniques that rely on a single model for upscaling (even with large magnification factors), our network uses a cascade of multiple SR models that progressively upscale the low-resolution images using steps of 2× . This characteristic allows us to apply supervision signals (target appearances) at different resolutions and incorporate identity constraints at multiple-scales. The proposed C-SRIP model (Cascaded Super Resolution with Identity Priors) is able to upscale (tiny) low-resolution images captured in unconstrained conditions and produce visually convincing results for diverse low-resolution inputs. We rigorously evaluate the proposed model on the Labeled Faces in the Wild (LFW), Helen and CelebA datasets and report superior performance compared to the existing state-of-the-art.

Highlights

  • F ACE hallucination (FH) represents a domain-specific super-resolution (SR) problem where the goal is to recover high-resolution (HR) facial images from lowresolution (LR) inputs [1]

  • As we show through extensive experiments on the Labeled Faces in the Wild (LFW), Helen and CelebA datasets, the combination of reconstruction-oriented and identity-related losses results in visually convincing super-resolved face images that compare favourably with state-of-the-art FH models from literature

  • One performance decrease we see is when switching from the MSE loss (Baseline) to the structural similarity [35] (SSIM)-based loss (B-SSIM), which slightly lowers the average Peak Signal-to-Noise Ratio (PSNR) score on LFW and HELEN, but results in higher SSIM and VIF scores on all three datasets

Read more

Summary

Introduction

F ACE hallucination (FH) represents a domain-specific super-resolution (SR) problem where the goal is to recover high-resolution (HR) facial images from lowresolution (LR) inputs [1]. Face hallucination techniques have important applications in various face-related vision tasks, such as face editing, face detection, 3D face reconstruction or face recognition [2]–[10], where they are used to counteract performance degradations caused by low-resolution input images. To general single-image super-resolution tasks, face hallucination is inherently ill-posed. Given a fixed imagedegradation model, every LR facial image can be shown to Manuscript received February 12, 2019; revised June 26, 2019 and August 23, 2019; accepted September 18, 2019. The associate editor coordinating the review of this manuscript and approving it for publication was Dr Chia-Kai Liang. The associate editor coordinating the review of this manuscript and approving it for publication was Dr Chia-Kai Liang. (Corresponding author: Klemen Grm.)

Objectives
Methods
Results
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call