CLSA: A Contrastive Learning Framework with Selective Aggregation for Video Rescaling.

Yuan Tian,Yichao Yan,Li Chen,Guangtao Zhai,Zhiyong Gao

doi:10.1109/tip.2023.3242774

Abstract

Video rescaling has recently drawn extensive attention for its practical applications such as video compression. Compared to video super-resolution, which focuses on upscaling bicubic-downscaled videos, video rescaling methods jointly optimize a downscaler and a upscaler. However, the inevitable loss of information during downscaling makes the upscaling procedure still ill-posed. Furthermore, the network architecture of previous methods mostly relies on convolution to aggregate information within local regions, which cannot effectively capture the relationship between distant locations. To address the above two issues, we propose a unified video rescaling framework by introducing the following designs. First, we propose to regularize the information of the downscaled videos via a contrastive learning framework, where, particularly, hard negative samples for learning are synthesized online. With this auxiliary contrastive learning objective, the downscaler tends to retain more information that benefits the upscaler. Second, we present a selective global aggregation module (SGAM) to efficiently capture long-range redundancy in high-resolution videos, where only a few representative locations are adaptively selected to participate in the computationally-heavy self-attention (SA) operations. SGAM enjoys the efficiency of the sparse modeling scheme while preserving the global modeling capability of SA. We refer to the proposed framework as Contrastive Learning framework with Selective Aggregation (CLSA) for video rescaling. Comprehensive experimental results show that CLSA outperforms video rescaling and rescaling-based video compression methods on five datasets, achieving state-of-the-art performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CLSA: A Contrastive Learning Framework with Selective Aggregation for Video Rescaling.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing

Lead the way for us

Journal: IEEE Transactions on Image Processing	Publication Date: Jan 1, 2023
Citations: 9

Similar Papers

Neural Networks for Image and Video Compression
Artem Gorodilov ... Dmitriy Gavrilov
-
Artem Gorodilov, et. al.Artem Gorodilov ... Dmitriy Gavrilov
01 Oct 2018
01 Oct 2018

Compressed Domain Deep Video Super-Resolution.
Peilin Chen ... Shiqi Wang
IEEE Transactions on Image Processing | VOL. 30
Peilin Chen, et. al.Peilin Chen ... Shiqi Wang
01 Jan 2020
IEEE Transactions on Image Processing | VOL. 30

Unveiling the SALSTM-M5T model and its python implementation for precise solar radiation prediction
Mohammad Ehteram ... Hanieh Shabanian
Energy Reports | VOL. 10
Mohammad Ehteram, et. al.Mohammad Ehteram ... Hanieh Shabanian
11 Oct 2023
Energy Reports | VOL. 10

Subject Review: Video Compression Algorithms
Amal Abbas Kadhim ... Zuhair Hussein Ali
International Journal of Engineering Research and Advanced Technology | VOL. 06
Amal Abbas Kadhim, et. al.Amal Abbas Kadhim ... Zuhair Hussein Ali
01 Jan 2020
International Journal of Engineering Research and Advanced Technology | VOL. 06

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CLSA: A Contrastive Learning Framework with Selective Aggregation for Video Rescaling.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing