Nonlinear Transform Coding

Johannes Balle,Sung Jin Hwang,Saurabh Singh,Nick Johnston,Philip A Chou,David Minnen,George Toderici,Eirikur Agustsson

doi:10.1109/jstsp.2020.3034501

Johannes Balle, Sung Jin Hwang + Show 6 more

Open Access

https://doi.org/10.1109/jstsp.2020.3034501

Copy DOI

Abstract

We review a class of methods that can be collected under the name nonlinear transform coding (NTC), which over the past few years have become competitive with the best linear transform codecs for images, and have superseded them in terms of rate-distortion performance under established perceptual quality metrics such as MS-SSIM. We assess the empirical rate-distortion performance of NTC with the help of simple example sources, for which the optimal performance of a vector quantizer is easier to estimate than with natural data sources. To this end, we introduce a novel variant of entropy-constrained vector quantization. We provide an analysis of various forms of stochastic optimization techniques for NTC models; review architectures of transforms based on artificial neural networks, as well as learned entropy models; and provide a direct comparison of a number of methods to parameterize the rate-distortion trade-off of nonlinear transforms, introducing a simplified one.

Highlights

There is no end in sight for the world’s reliance on multimedia communication
This paper reviews some of the recent developments in datadriven lossy compression; in particular, we focus on a class of methods that can be collectively called nonlinear transform coding (NTC), providing insights into its capabilities and challenges
In linear transform coding with a Gaussian source assumption, the probabilistic model P in eq (9) is typically considered to be a distribution factorized over each latent dimension, since the Karhunen–Loève Transform (KLT) factorizes the source

Summary

INTRODUCTION

There is no end in sight for the world’s reliance on multimedia communication. Digital devices have been increasingly permeating our daily lives, and with them comes the need to store, send, and receive images and audio ever more efficiently. Transform coding (TC) has been the method of choice for compressing this type of data source. It turns out that in combination with stochastic optimization methods, such as stochastic gradient descent (SGD), and massively parallel computational hardware, a nearly universal set of tools for function approximation has emerged. These tools have been used in the context of data compression [4]–[9]. This paper reviews some of the recent developments in datadriven lossy compression; in particular, we focus on a class of methods that can be collectively called nonlinear transform coding (NTC), providing insights into its capabilities and challenges. The last two sections discuss connections to related work and conclude the paper, respectively

STOCHASTIC RATE–DISTORTION OPTIMIZATION

Variational entropy-constrained vector quantization

NONLINEAR TRANSFORM CODING

Optimization and proxy rate–distortion loss

The soft quantization

Nonlinear transforms

LEARNED ENTROPY MODELS

RD TRAVERSAL WITH λ-PARAMETERIZATION

RELATED WORK

CONCLUSION

Local properties of nonlinear transforms

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE journal of selected topics in signal processing	Publication Date: Nov 9, 2020
Citations: 164	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Nonlinear Transform Coding

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE journal of selected topics in signal processing

Lead the way for us

Similar Papers

Adopting Perceptual Quality Metrics in Video Encoders: Progress and Critiques
Po-Yen Su ... Chieh-Kai Kao
-
Po-Yen Su, et. al.Po-Yen Su ... Chieh-Kai Kao
01 Jul 2012
01 Jul 2012

Application of a perceptual speech quality metric for link adaptation in wireless systems
B Rohani ... H.-J Zepernick
-
B Rohani, et. al.B Rohani ... H.-J Zepernick
20 Sep 2004
20 Sep 2004

Segmentation-driven perceptual quality metrics
A Cavallaro ... S Winkler
-
A Cavallaro, et. al.A Cavallaro ... S Winkler
24 Oct 2004
24 Oct 2004

Perceptual quality metrics applied to still image compression
Michael P Eckert ... Andrew P Bradley
Signal processing | VOL. 70
Michael P Eckert, et. al.Michael P Eckert ... Andrew P Bradley
01 Nov 1998
Signal processing | VOL. 70

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Nonlinear Transform Coding

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE journal of selected topics in signal processing