Learned image compression with generalized octave convolution and cross-resolution parameter estimation

Haisheng Fu,Feng Liang

doi:10.1016/j.sigpro.2022.108778

Abstract

Recently, image compression approaches based on deep learning have gradually outperformed existing image compression standards including BPG and VVC intra coding. In particular, the application of the context-adaptive entropy model significantly improves the rate-distortion (R-D) performance, in which hyperpriors and autoregressive models are jointly utilized to effectively capture the spatial redundancy of the latent representations. However, the latent representations still contain some spatial correlations. In addition, these methods based on the context-adaptive entropy model cannot be accelerated in the decoding process by parallel computing devices, e.g. FPGA or GPU. To alleviate these limitations, we propose a learned multi-resolution image compression framework, which exploits the recently developed octave convolutions to factorize the latent representations into the high-resolution (HR) and low-resolution (LR) parts, similar to wavelet transform, which further improves the R-D performance. To speed up the decoding, our scheme does not use context-adaptive entropy model. Instead, we exploit an additional hyper layer including hyper encoder and hyper decoder to further remove the spatial redundancy of the latent representation. Moreover, the cross-resolution parameter estimation (CRPE) is introduced into the proposed framework to enhance the flow of information and further improve the rate-distortion performance. An additional information-fidelity loss is proposed to the total loss function to adjust the contribution of the LR part to the final bit stream. Experimental results show that our method separately reduces the decoding time by approximately 73.35 and 93.44 % compared with that of state-of-the-art learned image compression methods, and the R-D performance is still better than H.266/VVC(4:2:0) and some learning-based methods on both PSNR and MS-SSIM metrics across a wide bit rates.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learned image compression with generalized octave convolution and cross-resolution parameter estimation

Abstract

Talk to us

Similar Papers

More From: Signal Processing

Lead the way for us

Journal: Signal Processing	Publication Date: Sep 12, 2022
Citations: 9

Similar Papers

Learned Progressive Image Compression With Dead-Zone Quantizers
Shaohui Li ... Hongkai Xiong
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33
Shaohui Li, et. al.Shaohui Li ... Hongkai Xiong
01 Jun 2023
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33

Checkerboard Context Model for Efficient Learned Image Compression
Dailan He ... Yaoyan Zheng
-
Dailan He, et. al.Dailan He ... Yaoyan Zheng
01 Jun 2021
01 Jun 2021

Learning Image and Video Compression Through Spatial-Temporal Energy Compaction
Zhengxue Cheng ... Masaru Takeuchi
-
Zhengxue Cheng, et. al.Zhengxue Cheng ... Masaru Takeuchi
01 Jun 2019
01 Jun 2019

Learned Block-Based Hybrid Image Compression
Yaojun Wu ... Xin Li
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32
Yaojun Wu, et. al.Yaojun Wu ... Xin Li
01 Jun 2022
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learned image compression with generalized octave convolution and cross-resolution parameter estimation

Abstract

Talk to us

Similar Papers

More From: Signal Processing