Multi-CartoonGAN with Conditional Adaptive Instance-Layer Normalization for Conditional Artistic Face Translation

Rina Komatsu,Tad Gonsalves

doi:10.3390/ai3010003

Abstract

In CycleGAN, an image-to-image translation architecture was established without the use of paired datasets by employing both adversarial and cycle consistency loss. The success of CycleGAN was followed by numerous studies that proposed new translation models. For example, StarGAN works as a multi-domain translation model based on a single generator–discriminator pair, while U-GAT-IT aims to close the large face-to-anime translation gap by adapting its original normalization to the process. However, constructing robust and conditional translation models requires tradeoffs when the computational costs of training on graphic processing units (GPUs) are considered. This is because, if designers attempt to implement conditional models with complex convolutional neural network (CNN) layers and normalization functions, the GPUs will need to secure large amounts of memory when the model begins training. This study aims to resolve this tradeoff issue via the development of Multi-CartoonGAN, which is an improved CartoonGAN architecture that can output conditional translated images and adapt to large feature gap translations between the source and target domains. To accomplish this, Multi-CartoonGAN reduces the computational cost by using a pretrained VGGNet to calculate the consistency loss instead of reusing the generator. Additionally, we report on the development of the conditional adaptive layer-instance normalization (CAdaLIN) process for use with our model to make it robust to unique feature translations. We performed extensive experiments using Multi-CartoonGAN to translate real-world face images into three different artistic styles: portrait, anime, and caricature. An analysis of the visualized translated images and GPU computation comparison shows that our model is capable of performing translations with unique style features that follow the conditional inputs and at a reduced GPU computational cost during training.

Highlights

Publisher’s Note: MDPI stays neutralStudies exploring deep learning modeling have expanded to the field of image processing
In the field of image recognition, Shutanov et al [1] explored the possibility of using convolutional neural networks (CNNs) to recognize traffic signs
On graphic processing units (GPUs) because of the need to repeatedly use the generator to obtain cycle consistency loss. This means that, in situations where a generator consists of multiple CNN layers and complex normalization layers, additional computational resources are required when the generator is reused. In response to these issues, this study aims to construct an N domain translation model that deals with extreme appearance translations by saving computational costs at the start of the training

Summary

Introduction

Studies exploring deep learning modeling have expanded to the field of image processing. In the field of image recognition, Shutanov et al [1] explored the possibility of using convolutional neural networks (CNNs) to recognize traffic signs. Most image recognition and improvement tasks require the preparation of both input and target paired data, such as classified labels for recognition and cleaned images to improve noisy input images. Preparing target images is often a cumbersome task depending on the image processing method. This is true in the case of imageto-image translation tasks, such as translating real-world photos into segmented images under supervised learning conditions, because of the need to search for and generate paired images

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: AI	Publication Date: Jan 24, 2022
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Multi-CartoonGAN with Conditional Adaptive Instance-Layer Normalization for Conditional Artistic Face Translation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: AI

Lead the way for us

Similar Papers

Patient-specific modelling of pulmonary airflow using GPU cluster for the application in medical practice
T Miki ... T Yamaguchi
Computer Methods in Biomechanics and Biomedical Engineering | VOL. 15
T Miki, et. al.T Miki ... T Yamaguchi
02 Aug 2011
Computer Methods in Biomechanics and Biomedical Engineering | VOL. 15

TU-FG-BRB-07: GPU-Based Prompt Gamma Ray Imaging From Boron Neutron Capture Therapy
...
Medical Physics | VOL. 43
, et. al. ...
01 Jun 2016
Medical Physics | VOL. 43

Parallel computation methods for enhanced MOM and MLFMM performance
Kristie D'Ambrosio ... Praveen Anumolu
-
Kristie D'Ambrosio, et. al.Kristie D'Ambrosio ... Praveen Anumolu
01 May 2009
01 May 2009

Utilizing GPU Virtualization to Protect the Private Keys of GPU Cryptographic Computation
Ziyang Wang ... Jingqiang Lin
-
Ziyang Wang, et. al.Ziyang Wang ... Jingqiang Lin
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-CartoonGAN with Conditional Adaptive Instance-Layer Normalization for Conditional Artistic Face Translation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: AI