The Intra-Class and Inter-Class Relationships in Style Transfer

Meng Qi,Xin Cui,Yi Niu,Bingxin Li

doi:10.3390/app8091681

Abstract

Neural style transfer, which has attracted great attention in both academic research and industrial engineering and demonstrated very exciting and remarkable results, is the technique of migrating the semantic content of one image to different artistic styles by using convolutional neural network (CNN). Recently, the Gram matrices used in the original and follow-up studies for style transfer were theoretically shown to be equivalent to minimizing a specific Maximum Mean Discrepancy (MMD). Since the Gram matrices are not a must for style transfer, how to design the proper process for aligning the neural activation between images to perform style transfer is an important problem. After careful analysis of some different algorithms for style loss construction, we discovered that some algorithms consider the relationships between different feature maps of a layer obtained from the CNN (inter-class relationships), while some do not (intra-class relationships). Surprisingly, the latter often show more details and finer strokes in the results. To further support our standpoint, we propose two new methods to perform style transfer: one takes inter-class relationships into account and the other does not, and conduct comparative experiments with existing methods. The experimental results verified our observation. Our proposed methods can achieve comparable perceptual quality yet with a lower complexity. We believe that our interpretation provides an effective design basis for designing style loss function for style transfer methods with different visual effects.

Highlights

Transferring the look and feel of one image to another image is an interesting but difficult problem
Since we wanted to conduct the comparative experiments with the Gram matrix algorithm, we set the value of ratio α/β to be the same as in the Gram matrix algorithm
The Neural Style Transfer algorithm proposed by Gatys et al produces fantastic stylized images with the appearance of a given famous artwork

Summary

Introduction

Transferring the look and feel (style) of one image to another image is an interesting but difficult problem. Gatys et al proposed a pioneering work which captures the artistic style of the images and transfers it to other images by using the convolutional neural network (CNN) [4]. This work formulated the style transfer problem as trying to create an image which matches both the content and style statistics for a given pair of images which provide content and style information separately based on the neural activations of each layer in CNN. This work has attracted lots of attention and demonstrated interesting and remarkable visual results in both research laboratory and industry, the key technology of the work, especially the reason that Gram matrix can represent artistic style, remains a mystery. By tacking many of these layers on top of one another, CNNs can develop abstract and high-level representations of the image content in the deeper layers of the network

Methods

Results

Conclusion