Abstract

There are many tasks on image aesthetic assessment, such as aesthetic classification, scoring, score distribution prediction, and captions. Due to the distribution of the aesthetic score is unbalanced, the assessment models always output scores near the mean score. In this paper, we propose a fine-grained regression method for aesthetics score regression and combine position and channel attention mechanisms to enhance the aesthetic feature fusion. And by training the regression network separately from the classification network, we make the classification task a complement to the regression task. Besides, the researchers are used to using Mean Square Error (MSE) as the main evaluation metric which is inadequate in measuring the error of each interval. In order to fully consider the images of the various aesthetic score segments, instead of focusing on the intermediate aesthetic score segments because of the imbalance of the aesthetic datasets, we propose a new evaluation metric called Segmented Mean Square Errors (SMSE) to prove the advantages of the model. We divide the entire AADB dataset into 10 equal parts based on the aesthetic scores and the experiments were carried out on each of the segmented AADB datasets. In this way, images for each aesthetic score segment are fairly considered. The experimental results reveal that our method outperforms all the state-of-the-art methods on both MSE and SMSE. The dual attention modules of position and channel also make the activation maps more reasonable. Our methods make the aesthetic scoring go beyond laboratories to real life applications. Because computational visual aesthetics is a very interesting and challenging task in the field of computer vision, and computer vision is also one of the key areas of focus of this journal, the method proposed in this paper is closely related to the field covered by the journal.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.