Reconstruction-free Image Compression for Machine Vision via Knowledge Transfer

Hanyue Tu,Houqiang Li,Wengang Zhou,Li Li

doi:10.1145/3678471

Abstract

Reconstruction-free image compression for machine vision aims to perform machine vision tasks directly on compressed-domain representations instead of reconstructed images. Existing reports have validated the feasibility of compressed-domain machine vision. However, we observe that when using recent learned compression models, the performance gap between compressed-domain and pixel-domain vision tasks is still large due to the lack of some natural inductive biases in pixel-domain convolutional neural networks. In this paper, we attempt to address this problem by transferring knowledge from pixel domain to compressed domain. A knowledge transfer loss defined at both output level and feature level is proposed to narrow the gap between compressed domain and pixel domain. In addition, we modify neural networks for pixel-domain vision tasks to better suit compressed-domain inputs. Experimental results on several machine vision tasks show that the proposed method improves the accuracy of compressed-domain vision tasks significantly, which even outperforms learning on reconstructed images while avoiding the computational cost of image reconstruction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reconstruction-free Image Compression for Machine Vision via Knowledge Transfer

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Similar Papers

Improving Multiple Machine Vision Tasks in the Compressed Domain
Jinming Liu ... Heming Sun
-
Jinming Liu, et. al.Jinming Liu ... Heming Sun
21 Aug 2022
21 Aug 2022

Towards Coding for Human and Machine Vision: Scalable Face Image Coding
Shuai Yang ... Ling-Yu Duan
IEEE Transactions on Multimedia | VOL. 23
Shuai Yang, et. al.Shuai Yang ... Ling-Yu Duan
01 Jan 2020
IEEE Transactions on Multimedia | VOL. 23

Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics.
Wenhan Yang ... Ling-Yu Duan
IEEE transactions on pattern analysis and machine intelligence | VOL. 46
Wenhan Yang, et. al.Wenhan Yang ... Ling-Yu Duan
01 Jul 2024
IEEE transactions on pattern analysis and machine intelligence | VOL. 46

Slimmable Multi-Task Image Compression for Human and Machine Vision
Jiangzhong Cao ... Huan Zhang
IEEE Access | VOL. 11
Jiangzhong Cao, et. al.Jiangzhong Cao ... Huan Zhang
01 Jan 2023
IEEE Access | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reconstruction-free Image Compression for Machine Vision via Knowledge Transfer

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications