Multi-Modality Deep Network for Extreme Learned Image Compression

Xuhao Jiang,Bo Yan,Tian Tan,Weimin Tan,Liquan Shen

doi:10.1609/aaai.v37i1.25184

Abstract

Image-based single-modality compression learning approaches have demonstrated exceptionally powerful encoding and decoding capabilities in the past few years , but suffer from blur and severe semantics loss at extremely low bitrates. To address this issue, we propose a multimodal machine learning method for text-guided image compression, in which the semantic information of text is used as prior information to guide image compression for better compression performance. We fully study the role of text description in different components of the codec, and demonstrate its effectiveness. In addition, we adopt the image-text attention module and image-request complement module to better fuse image and text features, and propose an improved multimodal semantic-consistent loss to produce semantically complete reconstructions. Extensive experiments, including a user study, prove that our method can obtain visually pleasing results at extremely low bitrates, and achieves a comparable or even better performance than state-of-the-art methods, even though these methods are at 2x to 4x bitrates of ours.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Modality Deep Network for Extreme Learned Image Compression

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 4

Similar Papers

Shrinkage as Activation for Learned Image Compression
Ogun Kirmemis ... A Murat Tekalp
-
Ogun Kirmemis, et. al.Ogun Kirmemis ... A Murat Tekalp
01 Oct 2020
01 Oct 2020

Learned Progressive Image Compression With Dead-Zone Quantizers
Shaohui Li ... Hongkai Xiong
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33
Shaohui Li, et. al.Shaohui Li ... Hongkai Xiong
01 Jun 2023
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33

Quantization Table Selection Using Firefly with Teaching and Learning Based Optimization Algorithm for Image Compression
D Preethi ... D Loganathan
-
D Preethi, et. al.D Preethi ... D Loganathan
01 Jan 2019
01 Jan 2019

Modified Firefly Algorithm for Vector Quantization Codebook Design in Image Compression
Ms D Preethi ... Dr D Loganathan
International Journal of Engineering and Advanced Technology | VOL. 8
Ms D Preethi, et. al.Ms D Preethi ... Dr D Loganathan
30 Aug 2019
International Journal of Engineering and Advanced Technology | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Modality Deep Network for Extreme Learned Image Compression

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence