Quality Of Compressed Video Research Articles

Deep learning methods have achieved impressive performance in compressed video quality enhancement tasks. However, these methods rely excessively on practical experience by manually designing the network structure and do not fully exploit the potential of the feature information contained in the video sequences, i.e., not taking full advantage of the multiscale similarity of the compressed artifact information and not seriously considering the impact of the partition boundaries in the compressed video on the overall video quality. In this article, we propose a novel Mixed Difference Equation inspired Transformer (MDEformer) for compressed video quality enhancement, which provides a relatively reliable principle to guide the network design and yields a new insight into the interpretable transformer. Specifically, drawing on the graphical concept of the mixed difference equation (MDE), we utilize multiple cross-layer cross-attention aggregation (CCA) modules to establish long-range dependencies between encoders and decoders of the transformer, where partition boundary smoothing (PBS) modules are inserted as feedforward networks. The CCA module can make full use of the multiscale similarity of compression artifacts to effectively remove compression artifacts, and recover the texture and detail information of the frame. The PBS module leverages the sensitivity of smoothing convolution to partition boundaries to eliminate the impact of partition boundaries on the quality of compressed video and improve its overall quality, while not having too much impacts on non-boundary pixels. Extensive experiments on the MFQE 2.0 dataset demonstrate that the proposed MDEformer can eliminate compression artifacts for improving the quality of the compressed video, and surpasses the state-of-the-arts (SOTAs) in terms of both objective metrics and visual quality.

We study the visual quality judgments of human subjects on digital human avatars (sometimes referred to as "holograms" in the parlance of virtual reality [VR] and augmented reality [AR] systems) that have been subjected to distortions. We also study the ability of video quality models to predict human judgments. As streaming human avatar videos in VR or AR become increasingly common, the need for more advanced human avatar video compression protocols will be required to address the tradeoffs between faithfully transmitting high-quality visual representations while adjusting to changeable bandwidth scenarios. During transmission over the internet, the perceived quality of compressed human avatar videos can be severely impaired by visual artifacts. To optimize trade-offs between perceptual quality and data volume in practical workflows, video quality assessment (VQA) models are essential tools. However, there are very few VQA algorithms developed specifically to analyze human body avatar videos, due, at least in part, to the dearth of appropriate and comprehensive datasets of adequate size. Towards filling this gap, we introduce the LIVE-Meta Rendered Human Avatar VQA Database, which contains 720 human avatar videos processed using 20 different combinations of encoding parameters, labeled by corresponding human perceptual quality judgments that were collected in six degrees of freedom VR headsets. To demonstrate the usefulness of this new and unique video resource, we use it to study and compare the performances of a variety of state-of-the-art Full Reference and No Reference video quality prediction models, including a new model called HoloQA. As a service to the research community, we publicly releases the metadata of the new database at https://live.ece.utexas.edu/research/LIVE-Meta-rendered-human-avatar/index.html.

Quality Of Compressed Video Research Articles

Related Topics

Articles published on Quality Of Compressed Video

Spatio-temporal feature learning for enhancing video quality based on screen content characteristics

Reconstruction flow recurrent network for compressed video quality enhancement

Subjective and objective quality evaluation of UGC video after encoding and decoding

PixRevive: Latent Feature Diffusion Model for Compressed Video Quality Enhancement.

End-to-End Transformer for Compressed Video Quality Enhancement

MDEformer: Mixed Difference Equation Inspired Transformer for Compressed Video Quality Enhancement.

Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality.

Perceptual Video Coding for Machines via Satisfied Machine Ratio Modeling.

Multi-Swin Transformer based Spatio-Temporal Information Exploration for Compressed Video Quality Enhancement

Compressed Video Quality Enhancement With Temporal Group Alignment and Fusion

A Comparative Measurement Study of Point Cloud-Based Volumetric Video Codecs

Video Compression Artifact Reduction by Fusing Motion Compensation and Global Context in a Swin-CNN Based Parallel Architecture

Improved Recurrence Plots Compression Distance by Learning Parameter for Video Compression Quality.

Compressed video quality enhancement algorithm based on 3D-CNNs

Deep Quality Assessment of Compressed Videos: A Subjective and Objective Study

Improving Compressed Video Using Single Lightweight Model with Temporal Fusion Module.

FastCNN: Towards Fast and Accurate Spatiotemporal Network for HEVC Compressed Video Enhancement

Fast CU Decision Algorithm Based on Texture Complexity and CNN for VVC

Multi-Frame Compressed Video Quality Enhancement by Spatio-Temporal Information Balance

Improving video quality by predicting inter-frame residuals based on an additive 3D-CNN model

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Quality Of Compressed Video Research Articles

Related Topics

Articles published on Quality Of Compressed Video

Spatio-temporal feature learning for enhancing video quality based on screen content characteristics

Reconstruction flow recurrent network for compressed video quality enhancement

Subjective and objective quality evaluation of UGC video after encoding and decoding

PixRevive: Latent Feature Diffusion Model for Compressed Video Quality Enhancement.

End-to-End Transformer for Compressed Video Quality Enhancement

MDEformer: Mixed Difference Equation Inspired Transformer for Compressed Video Quality Enhancement.

Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality.

Perceptual Video Coding for Machines via Satisfied Machine Ratio Modeling.

Multi-Swin Transformer based Spatio-Temporal Information Exploration for Compressed Video Quality Enhancement

Compressed Video Quality Enhancement With Temporal Group Alignment and Fusion

A Comparative Measurement Study of Point Cloud-Based Volumetric Video Codecs

Video Compression Artifact Reduction by Fusing Motion Compensation and Global Context in a Swin-CNN Based Parallel Architecture

Improved Recurrence Plots Compression Distance by Learning Parameter for Video Compression Quality.

Compressed video quality enhancement algorithm based on 3D-CNNs

Deep Quality Assessment of Compressed Videos: A Subjective and Objective Study

Improving Compressed Video Using Single Lightweight Model with Temporal Fusion Module.

FastCNN: Towards Fast and Accurate Spatiotemporal Network for HEVC Compressed Video Enhancement

Fast CU Decision Algorithm Based on Texture Complexity and CNN for VVC

Multi-Frame Compressed Video Quality Enhancement by Spatio-Temporal Information Balance

Improving video quality by predicting inter-frame residuals based on an additive 3D-CNN model