Towards End-to-End Image Compression and Analysis with Transformers

Yuanchao Bai,Wen Gao,Xu Yang,Junjun Jiang,Xianming Liu,Xiangyang Ji,Yaowei Wang

doi:10.1609/aaai.v36i1.19884

Abstract

We propose an end-to-end image compression and analysis model with Transformers, targeting to the cloud-based image classification application. Instead of placing an existing Transformer-based image classification model directly after an image codec, we aim to redesign the Vision Transformer (ViT) model to perform image classification from the compressed features and facilitate image compression with the long-term information from the Transformer. Specifically, we first replace the patchify stem (i.e., image splitting and embedding) of the ViT model with a lightweight image encoder modelled by a convolutional neural network. The compressed features generated by the image encoder are injected convolutional inductive bias and are fed to the Transformer for image classification bypassing image reconstruction. Meanwhile, we propose a feature aggregation module to fuse the compressed features with the selected intermediate features of the Transformer, and feed the aggregated features to a deconvolutional neural network for image reconstruction. The aggregated features can obtain the long-term information from the self-attention mechanism of the Transformer and improve the compression performance. The rate-distortion-accuracy optimization problem is finally solved by a two-step training strategy. Experimental results demonstrate the effectiveness of the proposed model in both the image compression and the classification tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards End-to-End Image Compression and Analysis with Transformers

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 28, 2022
Citations: 25

Similar Papers

2C-Net: integrate image compression and classification via deep neural network
Linfeng Liu ... Haojie Liu
Multimedia Systems | VOL. 29
Linfeng Liu, et. al.Linfeng Liu ... Haojie Liu
01 Dec 2022
Multimedia Systems | VOL. 29

Image Segmentation and Classification using Neural Network
Fatema Tuj Zohra ... Shaheena Sultana
International Journal of Computer Science and Information Technology | VOL. 16
Fatema Tuj Zohra, et. al.Fatema Tuj Zohra ... Shaheena Sultana
28 Feb 2024
International Journal of Computer Science and Information Technology | VOL. 16

A memory model for image recognition and classification based on convolutional neural network and Bayesian decision
Ying Jiang ... Weifeng Liu
SCIENTIA SINICA Technologica | VOL. 47
Ying Jiang, et. al.Ying Jiang ... Weifeng Liu
01 Sep 2017
SCIENTIA SINICA Technologica | VOL. 47

Concatenated Image Compression and Encryption
Abdulaziz Alshaya ... Saleh Komies
-
Abdulaziz Alshaya, et. al.Abdulaziz Alshaya ... Saleh Komies
06 Dec 2021
06 Dec 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards End-to-End Image Compression and Analysis with Transformers

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence