Sketch Assisted Face Image Coding for Human and Machine Vision: A Joint Training Approach

Xin Fang,Xiaoming Tao,Qiyuan Du,Fan Li,Yiping Duan

doi:10.1109/tcsvt.2023.3262251

Abstract

Image coding is one of the most fundamental techniques and is widely used in image/video processing and multimedia communications. Current image coding methods are mainly human-oriented, and the visual quality is always unsatisfactory, especially at low bitrates. Moreover, the recent emergence of machine vision goes beyond the scope of current coding. With these considerations, we proposed a sketch assisted face image coding for human and machine vision by a joint training approach. In the proposed approach, we design a new feature representation: a color sketch, which aims to satisfy both low-frequency features of human vision and high-frequency features of machine analysis. Then, we present a novel end-to-end image codec framework with joint training that consists of three models: an image-to-image translation module, a coding module, and a two-stage reconstruction module. Specifically, the input image is first translated into the edge map with the Canny edge as the auxiliary label to merely preserve the structure information. Afterward, the backpropagation from reconstruction module guides the edge map to increase or decrease the information through joint training, which results in the generation of color sketch. Then, the generated sketch is compressed into the bitstream and decompressed back to a sketch in the coding module. Finally, the decompressed sketch is reconstructed to support the machine and human tasks, respectively. In this way, the color sketch is designed to bridge the gap between human and machine vision, and the joint training strategy helps to adjust the low-frequency information in the sketch. The experimental results on challenge datasets demonstrate that our proposed algorithm offers 40.9%-86.6% bitrate savings on machine vision and is comparable to state-of-the-art image coding methods on human vision.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sketch Assisted Face Image Coding for Human and Machine Vision: A Joint Training Approach

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Oct 1, 2023
Citations: 4

Similar Papers

Towards Coding For Human And Machine Vision: A Scalable Image Coding Approach
Yueyu Hu ... Shuai Yang
-
Yueyu Hu, et. al.Yueyu Hu ... Shuai Yang
01 Jul 2020
01 Jul 2020

Towards Coding for Human and Machine Vision: Scalable Face Image Coding
Shuai Yang ... Yueyu Hu
IEEE Transactions on Multimedia | VOL. 23
Shuai Yang, et. al.Shuai Yang ... Yueyu Hu
01 Jan 2020
IEEE Transactions on Multimedia | VOL. 23

Facial Image Compression via Neural Image Manifold Compression
Wenhan Yang ... Alex C Kot
IEEE Transactions on Circuits and Systems for Video Technology | VOL. -
Wenhan Yang, et. al.Wenhan Yang ... Alex C Kot
01 Jan 2024
IEEE Transactions on Circuits and Systems for Video Technology | VOL. -

Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics.
Wenhan Yang ... Ling-Yu Duan
IEEE transactions on pattern analysis and machine intelligence | VOL. 46
Wenhan Yang, et. al.Wenhan Yang ... Ling-Yu Duan
01 Jul 2024
IEEE transactions on pattern analysis and machine intelligence | VOL. 46

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sketch Assisted Face Image Coding for Human and Machine Vision: A Joint Training Approach

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology