Optimized video compression with residual split attention and swin-block artifact contraction

Afsana Ahsan Jeny,Md Baharul Islam

doi:10.1016/j.jvcir.2022.103737

Abstract

Research in video compression has seen significant advancement in the last several years. However, the existing deep learning-based algorithms continue to be plagued by erroneous motion compression and ineffective motion compensation architectures, resulting in compression errors with a lower rate–distortion trade-off. To overcome these challenges, we present an end-to-end purely deep learning-based video compression method through a set of primary operations (e.g., motion estimation, motion compression, motion compensation, residual compression, and artifact contraction) differently. A deep residual attention split (DRAS) block is introduced for motion compression networks to pay more attention to certain image regions to create more effective features for the decoder while boosting the rate–distortion optimization (RDO) efficiency. A channel residual block (CRB) is proposed in motion compensation to yield a more accurate predicted frame, potentially improving the residual frame. To mitigate the compression errors, an artifact contraction module (ACM) by residual swin convolution UNet block is included in this model to improve the reconstruction quality. To improve the final frame, a buffer is added to fine-tune the previous reference frames. These modules combine with a loss function by assessing the trade-off and enhancing the decoded video quality. A comprehensive ablation study demonstrates the effectiveness of the proposed blocks and modules for video compression. Experimental results show the competitive performance of the proposed method on four benchmark datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimized video compression with residual split attention and swin-block artifact contraction

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation

Lead the way for us

Journal: Journal of Visual Communication and Image Representation	Publication Date: Dec 23, 2022
Citations: 2

Similar Papers

Coarse-To-Fine Deep Video Coding with Hyperprior-Guided Mode Prediction
Zhihao Hu ... Wei Jiang
-
Zhihao Hu, et. al.Zhihao Hu ... Wei Jiang
01 Jun 2022
01 Jun 2022

FVC: A New Framework towards Deep Video Compression in Feature Space
Zhihao Hu ... Dong Xu
-
Zhihao Hu, et. al.Zhihao Hu ... Dong Xu
01 Jun 2021
01 Jun 2021

Enhanced Motion Compensation for Deep Video Compression
Haifeng Guo ... Sam Kwong
IEEE Signal Processing Letters | VOL. 30
Haifeng Guo, et. al.Haifeng Guo ... Sam Kwong
01 Jan 2023
IEEE Signal Processing Letters | VOL. 30

Video Compression USING a New Active Mesh Based Motion Compensation Algorithm in Wavelet Sub-Bands
Mohammad Hossein Bisjerdi ... Alireza Behrad
Journal of Signal and Information Processing | VOL. 03
Mohammad Hossein Bisjerdi, et. al.Mohammad Hossein Bisjerdi ... Alireza Behrad
01 Jan 2012
Journal of Signal and Information Processing | VOL. 03

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimized video compression with residual split attention and swin-block artifact contraction

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation