Corner-to-Center long-range context model for efficient learned image compression

Yang Sui,Ding Ding,Xiang Pan,Xiaozhong Xu,Shan Liu,Bo Yuan,Zhenzhong Chen

doi:10.1016/j.jvcir.2023.103990

Abstract

In the framework of learned image compression, the context model plays a pivotal role in capturing the dependencies among latent representations. To reduce the decoding time resulting from the serial autoregressive context model, the parallel context model has been proposed as an alternative that necessitates only two passes during the decoding phase, thus facilitating efficient image compression in real-world scenarios. However, performance degradation occurs due to its incomplete casual context. To tackle this issue, we conduct an in-depth analysis of the performance degradation observed in existing parallel context models, focusing on two aspects: the Quantity and Quality of information utilized for context prediction and decoding. Based on such analysis, we propose the Corner-to-Center transformer-based Context Model (C3M) designed to enhance context and latent predictions and improve rate–distortion performance. Specifically, we leverage the logarithmic-based prediction order to predict more context features from corner to center progressively. In addition, to enlarge the receptive field in the analysis and synthesis transformation, we use the Long-range Crossing Attention Module (LCAM) in the encoder/decoder to capture the long-range semantic information by assigning the different window shapes in different channels. Extensive experimental evaluations show that the proposed method is effective and outperforms the state-of-the-art parallel methods. Finally, according to the subjective analysis, we suggest that improving the detailed representation in transformer-based image compression is a promising direction to be explored.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Corner-to-Center long-range context model for efficient learned image compression

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation

Lead the way for us

Similar Papers

Learned Progressive Image Compression With Dead-Zone Quantizers
Shaohui Li ... Hongkai Xiong
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33
Shaohui Li, et. al.Shaohui Li ... Hongkai Xiong
01 Jun 2023
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33

Learned Block-Based Hybrid Image Compression
Yaojun Wu ... Xin Li
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32
Yaojun Wu, et. al.Yaojun Wu ... Xin Li
01 Jun 2022
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32

Efficient image compression and decompression algorithms for OCR systems
Boban Arizanovic ... Vladan Vuckovic
Facta universitatis - series: Electronics and Energetics | VOL. 31
Boban Arizanovic, et. al.Boban Arizanovic ... Vladan Vuckovic
01 Jan 2018
Facta universitatis - series: Electronics and Energetics | VOL. 31

Learned Image Compression Using Cross-Component Attention Mechanism.
Wenhong Duan ... Siwei Ma
IEEE Transactions on Image Processing | VOL. 32
Wenhong Duan, et. al.Wenhong Duan ... Siwei Ma
01 Jan 2023
IEEE Transactions on Image Processing | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Corner-to-Center long-range context model for efficient learned image compression

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation