Abstract

Computational inference of aesthetics has recently become a hot topic due to its usefulness in widely applications such as evaluating image quality, retouching image and retrieving image. Owing to the subjectivity of this problem, there is no general framework to predict image aesthetics. In this paper, we propose a deep neural framework with visual attention module, self-generated global features and hybrid loss to address this problem. Specifically, the framework can be any state-of-the-art convolution classification network compatible with visual attention. Further, self-generated global feature compensates for the loss of global context information during training stage, and the hybrid loss guides the network to learn the similarity between the predicted aesthetic scores and the ground-truths through fusing soft-max-entropy and Earth Mover’s Distance(EMD). With the above-mentioned improvements, the proposed deep neural framework is capable of effectively predicting image aesthetics in an efficient way. In our experiments, we release a real-world aesthetic dataset that contains 1,800 2K photos labeled by several experienced photographers, and then provide a thorough ablation study of the design choices to better understand the superiority brought by each part of our framework, and design several comparisons with the state-of-the-art methods on a fraction of metrics. The experimental results on two datasets demonstrate that both accuracy and efficiency achieve favorably performance.

Highlights

  • I MAGE aesthetic quality assessment (IAQA) is a longstanding visual task, which lays foundation in many multimedia applications such as image retouching, image ranking and image retrieval

  • Considering that image aesthetic assessment is affected by global features, we propose and incorporate global features into our proposed network

  • The experiments include two parts: One part evaluates our methods D169(C&G&H) and IRV(C&G&H) against several state-of-the-art methods on datasets Alltuu and AVA. It first respectively presents the overall comparison in multi-class aesthetic quality classification and binary aesthetic quality classification

Read more

Summary

Introduction

I MAGE aesthetic quality assessment (IAQA) is a longstanding visual task, which lays foundation in many multimedia applications such as image retouching, image ranking and image retrieval. Photographic retouchers use photograph editing software to enhance images based on human aesthetic quality. It is essential to design an outperforming model to assess image aesthetic quality quickly. The goal of IAQA is to design the algorithms which automatically predict image aesthetic quality. It is challenging as the aesthetic score of given images relies on several undetermined factors, such as composition, color distribution, technical quality and so on. Earlier approaches aim to classify aesthetic attributes of an image using hand-crafted features and achieve good progress. Hand-crafted features depend heavily on expert knowledge, and cannot capture feature presentations comprehensively

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call