AFD-StackGAN: Automatic Mask Generation Network for Face De-Occlusion Using StackGAN.

Abdul Jabbar,Muhammad Assam,Muhammad Assad,Mimouna Abdullah Alkhonaini,Marwa Obayya,Javed Ali Khan,Fahd N Al-Wesabi,Xi Li

doi:10.3390/s22051747

Abstract

To address the problem of automatically detecting and removing the mask without user interaction, we present a GAN-based automatic approach for face de-occlusion, called Automatic Mask Generation Network for Face De-occlusion Using Stacked Generative Adversarial Networks (AFD-StackGAN). In this approach, we decompose the problem into two primary stages (i.e., Stage-I Network and Stage-II Network) and employ a separate GAN in both stages. Stage-I Network (Binary Mask Generation Network) automatically creates a binary mask for the masked region in the input images (occluded images). Then, Stage-II Network (Face De-occlusion Network) removes the mask object and synthesizes the damaged region with fine details while retaining the restored face’s appearance and structural consistency. Furthermore, we create a paired synthetic face-occluded dataset using the publicly available CelebA face images to train the proposed model. AFD-StackGAN is evaluated using real-world test images gathered from the Internet. Our extensive experimental results confirm the robustness and efficiency of the proposed model in removing complex mask objects from facial images compared to the previous image manipulation approaches. Additionally, we provide ablation studies for performance comparison between the user-defined mask and auto-defined mask and demonstrate the benefits of refiner networks in the generation process.

Highlights

Introduction published maps and institutional affilFace occlusion, a growing trend in recent years worldwide, is one of the leading causes of computer vision problems, such as face recognition, identification, tracking, detection, classification, face parsing, contour extraction, etc., which are challenging to tackle
This work automatically eliminates challenging mask objects from the face and synthesizes the damaged area with fine details while holding the restored face’s appearance and structural consistency; This work attempts to alleviate the manual mask selection burden by creating a straightforward method that can intelligently and automatically generate the occluded region’s binary mask in facial images; One potential application of an automatic mask generation network could be a video where mask objects continuously conceal the face’s structural semantics; We experimentally show that the proposed model with an automatically generated mask is more effective than those with manually generated masks for removing mask objects and generating realistic semantics of face images
The first row contains input images, the second row features corresponding binary masks generated by the mask generation network, the third row contains refined mask refined by the mask refiner network, and the last two rows show the output of Stage-II Network

Summary

Object Detection Methods

Object detection is the process of finding various objects in an image. Face occlusion detection aims to detect the facial region occluded by other objects. Several variants of FCN, such as [8,9,10], have been proposed to make it more appropriate for image segmentation tasks All these approaches use a modified version of the classification network (removing its fully connected layers and replacing them with a typical CNN layer) as an encoder to produce a low-resolution image representation. The Se-GAN segmentor network takes an image and visible area as its input and generates the mask of the whole object that has been occluded. Multi-Task GAN (MT-GAN) [14] used an SRN (super-resolution network) to up-scale the small-scale distorted image into the large-scale clear image for better detection. Instead of using these expensive algorithms to detect non-face objects in facial images automatically, we employ a simple encoder-decoder network architecture focusing on mask objects. The encoder-decoder network architecture has three convolution layers for the encoder part and three convolutions (transpose convolution) layers for the decoder part

Object Removal Methods

Our Approach

Stage-I Network

Stage-II Network

Total Loss Function

Experiments

Training and Implementation Details

Competing Methods

Synthetic Generated Dataset

Real-World Generated Dataset

Performance Evaluation Metrics

Results and Comparisons

Results of Stage-I Network

Results of Stage-II Network

Qualitative Comparisons

Quantitative Comparisons

Performance Comparison between Using User-Defined Mask and

Role of Refiner Networks

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Feb 23, 2022
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

AFD-StackGAN: Automatic Mask Generation Network for Face De-Occlusion Using StackGAN.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Recommend-Me
Thanh Duc Ngo ... Sang Phan
-
Thanh Duc Ngo, et. al.Thanh Duc Ngo ... Sang Phan
24 Mar 2014
24 Mar 2014

BIFRNet: A Brain-Inspired Feature Restoration DNN for Partially Occluded Image Recognition
Jiahong Zhang ... Bingyao Li
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37
Jiahong Zhang, et. al.Jiahong Zhang ... Bingyao Li
26 Jun 2023
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37

Towards combining commonsense reasoning and knowledge acquisition to guide deep learning
Mohan Sridharan ... Tiago Mota
Autonomous Agents and Multi-Agent Systems | VOL. 37
Mohan Sridharan, et. al.Mohan Sridharan ... Tiago Mota
01 Nov 2022
Autonomous Agents and Multi-Agent Systems | VOL. 37

Shape from shading using wavelets and weighted smoothness constraints
D Chen ... F Dong
IET Computer Vision | VOL. 4
D Chen, et. al.D Chen ... F Dong
01 Jan 2009
IET Computer Vision | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AFD-StackGAN: Automatic Mask Generation Network for Face De-Occlusion Using StackGAN.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors