End-to-End Blind Quality Assessment of Compressed Videos Using Deep Neural Networks

Wentao Liu,Zhou Wang,Zhengfang Duanmu

doi:10.1145/3240508.3240643

Abstract

Blind video quality assessment (BVQA) algorithms are traditionally designed with a two-stage approach - a feature extraction stage that computes typically hand-crafted spatial and/or temporal features, and a regression stage working in the feature space that predicts the perceptual quality of the video. Unlike the traditional BVQA methods, we propose a Video Multi-task End-to-end Optimized neural Network (V-MEON) that merges the two stages into one, where the feature extractor and the regressor are jointly optimized. Our model uses a multi-task DNN framework that not only estimates the perceptual quality of the test video but also provides a probabilistic prediction of its codec type. This framework allows us to train the network with two complementary sets of labels, both of which can be obtained at low cost. The training process is composed of two steps. In the first step, early convolutional layers are pre-trained to extract spatiotemporal quality-related features with the codec classification subtask. In the second step, initialized with the pre-trained feature extractor, the whole network is jointly optimized with the two subtasks together. An additional critical step is the adoption of 3D convolutional layers, which creates novel spatiotemporal features that lead to a significant performance boost. Experimental results show that the proposed model clearly outperforms state-of-the-art BVQA methods.The source code of V-MEON is available at https://ece.uwaterloo.ca/~zduanmu/acmmm2018bvqa.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

End-to-End Blind Quality Assessment of Compressed Videos Using Deep Neural Networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Blind video quality assessment based on Spatio-Temporal Feature Resolver
Xiaodong Bi ... Raymond Edward Sheriff
Neurocomputing | VOL. 574
Xiaodong Bi, et. al.Xiaodong Bi ... Raymond Edward Sheriff
11 Jan 2024
Neurocomputing | VOL. 574

HDR-BVQM: High dynamic range blind video quality model
Naima Aamir ... Imran Fareed Nizami
Multimedia Tools and Applications | VOL. 80
Naima Aamir, et. al.Naima Aamir ... Imran Fareed Nizami
23 May 2021
Multimedia Tools and Applications | VOL. 80

Blind video quality assessment based on multilevel video perception
Tongfeng Sun ... Wei Chen
Signal Processing: Image Communication | VOL. 99
Tongfeng Sun, et. al.Tongfeng Sun ... Wei Chen
21 Sep 2021
Signal Processing: Image Communication | VOL. 99

A New Blind Video Quality Metric for Assessing Different Turbulence Mitigation Algorithms
Chiman Kwan ... Bence Budavari
Electronics | VOL. 10
Chiman Kwan, et. al.Chiman Kwan ... Bence Budavari
16 Sep 2021
Electronics | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

End-to-End Blind Quality Assessment of Compressed Videos Using Deep Neural Networks

Abstract

Talk to us

Similar Papers