Visual aesthetic understanding: Sample-specific aesthetic classification and deep activation map visualization

Chao Zhang,Ce Zhu,Xun Xu,Yipeng Liu,Jimin Xiao,Tammam Tillo

doi:10.1016/j.image.2018.05.006

Abstract

Currently image aesthetic estimation using deep learning has achieved great success compared with the traditional methods by hand-crafted features. Similar to recognition problem, aesthetic estimation categorizes images into visually appealing or not. Nevertheless, it is desirable to understand why certain images are visually more appealing, in specific, which part of the image is contributing to the aesthetic preference. In fact, most traditional approaches adopting hand-crafted feature are, to some extent, able to understand part of image’s aesthetic and content information while few studies have been conducted in the context of deep learning. Moreover, we discover that aesthetic rating is ambiguous so that many examples are uncertain in aesthetic level. This has caused a highly imbalanced distribution of aesthetic ratings. To tackle all these issues, we propose an end-to-end convolutional neural network (CNN) model which simultaneously implements aesthetic classification and understanding. To overcome the imbalanced aesthetic ratings, a sample-specific classification method that re-weights samples’ importance is proposed. We find that dropping out ambiguous image, as common adopted by recent deep learning models, is a special case of the sample-specific method, and also figure out that as the weights of the non-ambiguous images increase, the performance is positively affected. In order to understand what is learned in the deep model, global average pooling (GAP) following the last feature map is employed to generate aesthetic activation map (AesAM) and attribute activation map (AttAM). AesAM and AttAM respectively represent the likelihood of aesthetic level for spatial location, and the likelihood of different attribute information. In particular, AesAM mainly accounts for what is learned in deep model. Experiments are carried out on public aesthetic datasets and state-of-the-art performance is achieved. Thanks to the introduction of AttAM, the aesthetic preference is explainable by visualization. Finally, a simple application on image cropping based on the AesAM is presented. The code and trained model will be publicly available on https://github.com/galoiszhang/AWCU.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Visual aesthetic understanding: Sample-specific aesthetic classification and deep activation map visualization

Abstract

Talk to us

Similar Papers

More From: Signal Processing: Image Communication

Lead the way for us

Journal: Signal Processing: Image Communication	Publication Date: May 29, 2018
Citations: 19

Similar Papers

Artificial intelligence: finding the intersection of predictive modeling and clinical utility
Karthik Ravi
Gastrointestinal Endoscopy | VOL. 93
Karthik RaviKarthik Ravi
07 Mar 2021
Gastrointestinal Endoscopy | VOL. 93

Developing a new deep learning CNN model to detect and classify highway cracks
Faris Elghaish ... Aso Hajirasouli
Journal of Engineering, Design and Technology | VOL. 20
Faris Elghaish, et. al.Faris Elghaish ... Aso Hajirasouli
16 Aug 2021
Journal of Engineering, Design and Technology | VOL. 20

An Interpretation Architecture for Deep Learning Models with the Application of COVID-19 Diagnosis.
Yuchai Wan ... Xun Zhang
Entropy (Basel, Switzerland) | VOL. 23
Yuchai Wan, et. al.Yuchai Wan ... Xun Zhang
07 Feb 2021
Entropy (Basel, Switzerland) | VOL. 23

Development and Validation of a Deep Neural Network for Accurate Identification of Endoscopic Images From Patients With Ulcerative Colitis and Crohn's Disease.
Guangcong Ruan ...
Frontiers in Medicine | VOL. 9
Guangcong Ruan, et. al.Guangcong Ruan ...
18 Mar 2022
Frontiers in Medicine | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Visual aesthetic understanding: Sample-specific aesthetic classification and deep activation map visualization

Abstract

Talk to us

Similar Papers

More From: Signal Processing: Image Communication