NDNetGaming - development of a no-reference deep CNN for gaming video quality prediction

Markus Utke,Sebastian Möller,Saman Zadtootaghaj,Sebastian Bosse,Steven Schmidt

doi:10.1007/s11042-020-09144-6

Abstract

Gaming video streaming services are growing rapidly due to new services such as passive video streaming of gaming content, e.g. Twitch.tv, as well as cloud gaming, e.g. Nvidia GeForce NOW and Google Stadia. In contrast to traditional video content, gaming content has special characteristics such as extremely high and special motion patterns, synthetic content and repetitive content, which poses new opportunities for the design of machine learning-based models to outperform the state-of-the-art video and image quality approaches for this special computer generated content. In this paper, we train a Convolutional Neural Network (CNN) based on an objective quality model, VMAF, as ground truth and fine-tuned it based on subjective image quality ratings. In addition, we propose a new temporal pooling method to predict gaming video quality based on frame-level predictions. Finally, the paper also describes how an appropriate CNN architecture can be chosen and how well the model performs on different contents. Our result shows that among four popular network architectures that we investigated, DenseNet performs best for image quality assessment based on the training dataset. By training the last 57 convolutional layers of DenseNet based on VMAF values, we obtained a high performance model to predict VMAF of distorted frames of video games with a Spearman’s Rank correlation (SRCC) of 0.945 and Root Mean Score Error (RMSE) of 7.07 on the image level, while achieving a higher performance on the video level leading to a SRCC of 0.967 and RMSE of 5.47 for the KUGVD dataset. Furthermore, we fine-tuned the model based on subjective quality ratings of images from gaming content which resulted in a SRCC of 0.93 and RMSE of 0.46 using one-hold-out cross validation. Finally, on the video level, using the proposed pooling method, the model achieves a very good performance indicated by a SRCC of 0.968 and RMSE of 0.30 for the used gaming video dataset.

Highlights

The gaming industry has been one of the largest digital markets for decades and is rapidly growing due to emerging online services such as gaming video streaming, online gaming and cloud gaming (CG) services
We describe the development process of a Convolutional Neural Network (CNN) based NR video quality metric that predicts the quality for video of gaming content impaired by compression artefacts
We evaluated a total of twelve image/video quality assessment (VQA) metrics on the dataset as follows: Peak Signal to Noise Ratio (PSNR) is the most widely used VQA metric and relies on the computation of the logarithmic difference between corresponding pixels in the original and impaired frame

Summary

Introduction

The gaming industry has been one of the largest digital markets for decades and is rapidly growing due to emerging online services such as gaming video streaming, online gaming and cloud gaming (CG) services. By using VMAF as a proxy of perceptual quality, a larger database can be generated compared to psychophysical tests This allows to train networks with a higher number of parameters, i.e. deeper networks as well as allowing the network learn different types of image distortions such as blockiness and bluriness. This is because we can increase the size of the dataset by recording and encoding more gaming video sequences and calculating VMAF scores for the encoded frames without conducting large scale subjective tests. The selection of VMAF, among many other quality metrics, is based on its high performance shown in previous studies in [5, 7]

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Multimedia Tools and Applications	Publication Date: Jul 24, 2020
Citations: 23	License type: open-access

R Discovery Prime

R Discovery Prime

NDNetGaming - development of a no-reference deep CNN for gaming video quality prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Similar Papers

Study of Subjective and Objective Quality Assessment of Mobile Cloud Gaming Videos.
Avinab Saha ... Xiaoming Wang
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society | VOL. 32
Avinab Saha, et. al.Avinab Saha ... Xiaoming Wang
01 Jan 2023
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society | VOL. 32

A New Algorithm for Automatically Calculating Noise, Spatial Resolution, and Contrast Image Quality Metrics: Proof-of-Concept and Agreement With Subjective Scores in Phantom and Clinical Abdominal CT.
Cécile R.L.P.N Jeukens ... Emmeline Laupman
Investigative Radiology | VOL. 58
Cécile R.L.P.N Jeukens, et. al.Cécile R.L.P.N Jeukens ... Emmeline Laupman
23 Jan 2023
Investigative Radiology | VOL. 58

Study on the Relationship between Different Body Mass Indexes and Puncture Pain and Image Quality in Patients Undergoing Coronary Angiography with Intravenous Indwelling Needle.
Jiani Ding ... Xiaoyu Chen
Emergency Medicine International | VOL. 2022
Jiani Ding, et. al.Jiani Ding ... Xiaoyu Chen
19 May 2022
Emergency Medicine International | VOL. 2022

<title>Radiologists' confidence in detecting abnormalities on chest images and their subjective judgments of image quality</title>
Jill L King ... F Leland Thaete
-
Jill L King, et. al.Jill L King ... F Leland Thaete
01 Jul 1991
01 Jul 1991

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

NDNetGaming - development of a no-reference deep CNN for gaming video quality prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Multimedia Tools and Applications