Unified Architecture Adaptation for Compressed Domain Semantic Inference.

Zhihao Duan,Zhan Ma,Fengqing Zhu

doi:10.1109/tcsvt.2023.3240391

Abstract

Advances in both lossy image compression and semantic content understanding have been greatly fueled by deep learning techniques, yet these two tasks have been developed separately for the past decades. In this work, we address the problem of directly executing semantic inference from quantized latent features in the deep compressed domain without pixel reconstruction. Although different methods have been proposed for this problem setting, they either are restrictive to a specific architecture, or are sub-optimal in terms of compressed domain task accuracy. In contrast, we propose a lightweight, plug-and-play solution which is generally compliant with popular learned image coders and deep vision models, making it attractive to vast applications. Our method adapts prevalent pixel domain neural models that are deployed for various vision tasks to directly accept quantized latent features (other than pixels). We further suggest training the compressed domain model by transferring knowledge from its corresponding pixel domain counterpart. Experiments show that our method is compliant with popular learned image coders and vision task models. Under fair comparison, our approach outperforms a baseline method by a) more than 3% top-1 accuracy for compressed domain classification, and b) more than 7% mIoU for compressed domain semantic segmentation, at various data rates.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unified Architecture Adaptation for Compressed Domain Semantic Inference.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Aug 1, 2023
Citations: 10

Similar Papers

A Video Saliency Detection Model in Compressed Domain
Yuming Fang ... Chia-Wen Lin
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 24
Yuming Fang, et. al.Yuming Fang ... Chia-Wen Lin
01 Jan 2014
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 24

Reconstruction-free Image Compression for Machine Vision via Knowledge Transfer
Hanyue Tu ... Li Li
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. -
Hanyue Tu, et. al.Hanyue Tu ... Li Li
17 Jul 2024
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. -

Examination of a tracking and detection method using compressed domain information
Erii Maekawa ... Satoshi Goto
-
Erii Maekawa, et. al.Erii Maekawa ... Satoshi Goto
01 Dec 2013
01 Dec 2013

<title>Image indexing and retrieval techniques: past, present, and next</title>
Jamshid Shanbehzadeh ... Charles A Bouman
-
Jamshid Shanbehzadeh, et. al.Jamshid Shanbehzadeh ... Charles A Bouman
23 Dec 1999
23 Dec 1999

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unified Architecture Adaptation for Compressed Domain Semantic Inference.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology