Abstract

The goal of full-reference image quality assessment (FR-IQA) is to predict the perceptual quality of an image as perceived by human observers using its pristine (distortion free) reference counterpart. In this study, we explore a novel, combined approach which predicts the perceptual quality of a distorted image by compiling a feature vector from convolutional activation maps. More specifically, a reference-distorted image pair is run through a pretrained convolutional neural network and the activation maps are compared with a traditional image similarity metric. Subsequently, the resulting feature vector is mapped onto perceptual quality scores with the help of a trained support vector regressor. A detailed parameter study is also presented in which the design choices of the proposed method is explained. Furthermore, we study the relationship between the amount of training images and the prediction performance. Specifically, it is demonstrated that the proposed method can be trained with a small amount of data to reach high prediction performance. Our best proposal—called ActMapFeat—is compared to the state-of-the-art on six publicly available benchmark IQA databases, such as KADID-10k, TID2013, TID2008, MDID, CSIQ, and VCL-FER. Specifically, our method is able to significantly outperform the state-of-the-art on these benchmark databases.

Highlights

  • In recent decades, a continuous growth in the number of digital images has been observed, due to the spread of smart phones and various social media

  • full-reference image quality assessment (FR-image quality assessment (IQA)) metrics are tested on the whole database, and we report on the Pearson’s linear correlation coefficient (PLCC), s rank-order correlation coefficient (SROCC), and Kendall’s rank-order correlation coefficient (KROCC)

  • We introduced a framework for FR-IQA relying on feature vectors, which were obtained by comparing reference and distorted activation maps by traditional image similarity metrics

Read more

Summary

Introduction

A continuous growth in the number of digital images has been observed, due to the spread of smart phones and various social media. As a result of the huge number of imaging sensors, there is a massive amount of visual data being produced each day. Digital images may suffer different distortions during the procedure of acquisition, transmission, or compression. Unsatisfactory perceived visual quality or a certain level of annoyance may occur. Image quality assessment (IQA) algorithms can be classified into three different classes based on the availability of the reference, undistorted image. Full-reference (FR) and reduced-reference (RR) IQA algorithms have full and partial information about the reference image, respectively. No-reference (NR) IQA methods do not posses any information about the reference image

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call