Panoramic Video Quality Assessment Based on Non-Local Spherical CNN

Jiachen Yang,Bin Jiang,Wen Lu,Tianlin Liu,Qinggang Meng

doi:10.1109/tmm.2020.2990075

Abstract

Panoramic video and stereoscopic panoramic video are essential carriers of virtual reality content, so it is very crucial to establish their quality assessment models for the standardization of virtual reality industry. However, it is very challenging to evaluate the quality of the panoramic video at present. One reason is that the spatial information of the panoramic video is warped due to the projection process, and the conventional video quality assessment (VQA) method is difficult to deal with this problem. Another reason is that the traditional VQA method is problematic to capture the complex global time information in the panoramic video. In response to the above questions, this paper presents an end-to-end neural network model to evaluate the quality of panoramic video and stereoscopic panoramic video. Compared to other panoramic video quality assessment methods, our proposed method combines spherical convolutional neural networks (CNN) and non-local neural networks, which can effectively extract complex spatiotemporal information of the panoramic video. We evaluate the method in two databases, VRQ-TJU and VR-VQA48. Experiments show the effectiveness of different modules in our method, and our method outperforms state-of-the-art other related methods.

Highlights

A S a new means of simulation and interaction, virtual reality (VR) has attracted more and more attention in recent years [1]
As a representative method of this idea, weighted-tospherically-Uniform PSNR (WS-PSNR) [21] is calculated according to the following formula: In order to resolve the contradiction between convolutional neural networks (CNN) and global time domain information, non-local neural networks are integrated into our proposed framework
We propose a method based on deep learning, which can evaluate the quality of panoramic video and stereo panoramic video end-to-end

Summary

INTRODUCTION

A S a new means of simulation and interaction, virtual reality (VR) has attracted more and more attention in recent years [1]. Yu et al [22] projected the pixels on the original panoramic video plane and the distorted panoramic video plane onto a sphere, and performed a large number of uniform sampling on the spherical surface to calculate the PSNR. They proposed two indicators, S-PSNR and L-PSNR, which differ in whether they give higher weight to the equator. Non-local neural networks module [25] makes the feature map in the neural network contain attention information, so the global time information of the panoramic video can be extracted together with the spherical CNN. We elaborate on the characteristics of panoramic video and related works (Section II), analyze our methods (Section III), evaluate our methods through a large number of experiments (Section IV), draw conclusions and discuss the future direction (Section V)

Spatial Domain Characteristics of Panoramic Video

Global Time Domain Information Extraction of Panoramic Video

General Idea of VRVQA

PROPOSED METHOD

Preprocessing

Spherical CNN

Non-local Neural Networks

Network Design and Training

Datasets

Experimental Setups

Performance Evaluation

Module Comparison Evaluation

Distortion Type Evaluation

Objective score

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Multimedia	Publication Date: Apr 28, 2020
Citations: 71	License type: cc-by

R Discovery Prime

R Discovery Prime

Panoramic Video Quality Assessment Based on Non-Local Spherical CNN

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Similar Papers

An objective assessment method based on multi-level factors for panoramic videos
Shu Yang ... Bo Zhang
-
Shu Yang, et. al.Shu Yang ... Bo Zhang
01 Dec 2017
01 Dec 2017

Video quality assessment using motion-compensated temporal filtering and manifold feature similarity.
Yang Song ... Mei Yu
PloS one | VOL. 12
Yang Song, et. al.Yang Song ... Mei Yu
26 Apr 2017
PloS one | VOL. 12

Evaluation of objective video quality assessment methods on video sequences with different spatial and temporal activity encoded at different spatial resolutions
Jelena Vlaović ... Mario Vranješ
International journal of electrical and computer engineering systems | VOL. 12
Jelena Vlaović, et. al.Jelena Vlaović ... Mario Vranješ
21 Apr 2021
International journal of electrical and computer engineering systems | VOL. 12

A framework for computationally efficient video quality assessment
Welington Y.L Akamine ... Mylène C.Q Farias
Signal Processing: Image Communication | VOL. 70
Welington Y.L Akamine, et. al.Welington Y.L Akamine ... Mylène C.Q Farias
22 Sep 2018
Signal Processing: Image Communication | VOL. 70

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Panoramic Video Quality Assessment Based on Non-Local Spherical CNN

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia