MRET: Multi-resolution transformer for video quality assessment

Junjie Ke,Feng Yang,Tianhao Zhang,Peyman Milanfar,Yilin Wang

doi:10.3389/frsip.2023.1137006

Junjie Ke, Feng Yang + Show 3 more

Open Access

https://doi.org/10.3389/frsip.2023.1137006

Copy DOI

Journal: Frontiers in Signal Processing	Publication Date: Mar 29, 2023
Citations: 4	License type: CC BY 4.0

Affiliation: Google (United States)

Abstract

No-reference video quality assessment (NR-VQA) for user generated content (UGC) is crucial for understanding and improving visual experience. Unlike video recognition tasks, VQA tasks are sensitive to changes in input resolution. Since large amounts of UGC videos nowadays are 720p or above, the fixed and relatively small input used in conventional NR-VQA methods results in missing high-frequency details for many videos. In this paper, we propose a novel Transformer-based NR-VQA framework that preserves the high-resolution quality information. With the multi-resolution input representation and a novel multi-resolution patch sampling mechanism, our method enables a comprehensive view of both the global video composition and local high-resolution details. The proposed approach can effectively aggregate quality information across different granularities in spatial and temporal dimensions, making the model robust to input resolution variations. Our method achieves state-of-the-art performance on large-scale UGC VQA datasets LSVQ and LSVQ-1080p, and on KoNViD-1k and LIVE-VQC without fine-tuning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MRET: Multi-resolution transformer for video quality assessment

Abstract

Talk to us

Similar Papers

More From: Frontiers in Signal Processing

Lead the way for us

Similar Papers

Bitrate-Based No-Reference Video Quality Assessment Combining the Visual Perception of Video Contents
Juncai Yao ... Guizhong Liu
IEEE Transactions on Broadcasting | VOL. 65
Juncai Yao, et. al.Juncai Yao ... Guizhong Liu
01 Sep 2019
IEEE Transactions on Broadcasting | VOL. 65

No-reference video quality assessment metric using spatiotemporal features through LSTM
Ngai-Wing Kwong ... Tsz-Kwan Lee
-
Ngai-Wing Kwong, et. al.Ngai-Wing Kwong ... Tsz-Kwan Lee
13 Mar 2021
13 Mar 2021

No-Reference Video Quality Assessment With 3D Shearlet Transform and Convolutional Neural Networks
Yuming Li ... Litong Feng
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 26
Yuming Li, et. al.Yuming Li ... Litong Feng
01 Jun 2016
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 26

No-Reference Video Quality Assessment by Multilayer Selected Deep Features and Neural Networks
Zheng-Lung Chu ... Kuan-Hsien Liu
-
Zheng-Lung Chu, et. al.Zheng-Lung Chu ... Kuan-Hsien Liu
28 Sep 2020
28 Sep 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MRET: Multi-resolution transformer for video quality assessment

Abstract

Talk to us

Similar Papers

More From: Frontiers in Signal Processing