Abstract

Recently, the visual quality evaluation of screen content images (SCIs) has become an important and timely emerging research theme. This article presents an effective and novel blind quality evaluation metric for SCIs by using stacked autoencoders (SAE) based on pictorial and textual regions. Since the SCI consists of not only the pictorial area but also the textual area, the human visual system (HVS) is not equally sensitive to their different distortion types. First, the textual and pictorial regions can be obtained by dividing an input SCI via an SCI segmentation metric. Next, we extract quality-aware features from the textual region and pictorial region, respectively. Then, two different SAEs are trained via an unsupervised approach for quality-aware features that are extracted from these two regions. After the training procedure of the SAEs, the quality-aware features can evolve into more discriminative and meaningful features. Subsequently, the evolved features and their corresponding subjective scores are input into two regressors for training. Each regressor can obtain one output predictive score. Finally, the final perceptual quality score of a test SCI is computed by these two predicted scores via a weighted model. Experimental results on two public SCI-oriented databases have revealed that the proposed scheme can compare favorably with the existing blind image quality assessment metrics.

Highlights

  • W ITH the flying start and rapid popularization of online gaming, remote desktop control, mobile web browsing, virtual screen sharing and other application scenarios, the processing and transmission of SCIs have become more and more extensive [1]

  • Several contrast experiments are conducted to prove the superiority of the presented metric and we analyze the impacts of the weighted model and the deep architecture in the designed model

  • The SIQAD database [2] is the first database to be established which is comprised of 20 original SCIs and 980 distorted SCIs corrupted by seven types of distorted versions

Read more

Summary

Introduction

W ITH the flying start and rapid popularization of online gaming, remote desktop control, mobile web browsing, virtual screen sharing and other application scenarios, the processing and transmission of SCIs have become more and more extensive [1]. The SCI is a composite image that contains computer-generated graphics and text, and natural. The new score is set as a nonlinear combination of quality scores via multiple metrics and appropriate weights acquired through the training process [4]–[6]. Different quality evaluation approaches can deal with different types of image distortions well. The proposed model applying multiple regressors is inspired by this concept. We introduce an image quality evaluator for SCIs via stacked auto-encoder based on different regions. In order to model the framework, the textual and pictorial regions are obtained and two parallel SAEs are trained in an unsupervised manner, respectively. The metric we put forward has the following contributions:

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call