Learning-Based Scalable Image Compression With Latent-Feature Reuse and Prediction

Yixin Mei,Fan Li,Li Li,Zhu Li

doi:10.1109/tmm.2021.3114548

Abstract

Recently, learning-based image compression model has attracted much attention due to its impressive performance and ease of optimization, compared with traditional DCT and wavelet-based image compression standards. Most learning-based image compression models are trained to minimize joint rate-distortion (RD) loss on one single RD trade-off point. However, in many multimedia applications, due to communication constraints, or display adaptation needs for different spatial formats, bit rates or power, it is necessary to provide a variety of image versions for different client devices. To fulfill this requirement, typical end-to-end image compression methods have to compress an image into several bit streams independently by a number of pre-trained networks, which are resource-consuming because of redundancy among these streams. To address this problem, inspired by traditional scalable video coding framework, we propose a learning-based end-to-end quality and spatial scalable image compression (QSSIC) model in multi-layer structure, in which each layer could generate one bitstream corresponding to a specified resolution and image fidelity. This scalability is achieved by exploring the potential of feature-domain representation prediction and reuse. To be specific, firstly, bitstreams of previous layers are used to predict the current layer representations which contains the enhancement information, and then only prediction residuals need to be coded in enhancement layers. Secondly, previous bitstreams are reused in image reconstruction in higher layers to provide basic information. The proposed model could be optimized in an end-to-end manner. Extensive experiments show that our method outperforms state-of-art deep neural networks (DNN)-based auto-encoders in simulcast scenarios. In addition, our method has a better performance than the traditional scalable image compression method scalable extension of H.264/AVC (SVC) and is comparable to scalable extension of H.265/HEVC (SHVC).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Multimedia	Publication Date: Jan 1, 2022
Citations: 12	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Learning-Based Scalable Image Compression With Latent-Feature Reuse and Prediction

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Similar Papers

High-Fidelity Variable-Rate Image Compression via Invertible Activation Transformation
Shilv Cai ... Luxin Yan
-
Shilv Cai, et. al.Shilv Cai ... Luxin Yan
10 Oct 2022
10 Oct 2022

Design of Optimized Neuro-Wavelet Based Hybrid Model for Image Compression
Deepak Gambhir ... Navin Rajpal
-
Deepak Gambhir, et. al.Deepak Gambhir ... Navin Rajpal
01 Jan 2010
01 Jan 2010

LSCIC Pre Coder for Image and Video Compression
Muhammad Kamran ... Wang YiZhuo
-
Muhammad Kamran, et. al.Muhammad Kamran ... Wang YiZhuo
01 Mar 2010
01 Mar 2010

Design and Optimization of Graph Transform for Image and Video Compression

-

01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning-Based Scalable Image Compression With Latent-Feature Reuse and Prediction

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia