Scalable Face Image Coding via StyleGAN Prior: Toward Compression for Human-Machine Collaborative Vision.

Qi Mao,Siwei Ma,Shiqi Wang,Libiao Jin,Ruijie Chen,Meng Wang,Chongyu Wang

doi:10.1109/tip.2023.3343912

Abstract

The accelerated proliferation of visual content and the rapid development of machine vision technologies bring significant challenges in delivering visual data on a gigantic scale, which shall be effectively represented to satisfy both human and machine requirements. In this work, we investigate how hierarchical representations derived from the advanced generative prior facilitate constructing an efficient scalable coding paradigm for human-machine collaborative vision. Our key insight is that by exploiting the StyleGAN prior, we can learn three-layered representations encoding hierarchical semantics, which are elaborately designed into the basic, middle, and enhanced layers, supporting machine intelligence and human visual perception in a progressive fashion. With the aim of achieving efficient compression, we propose the layer-wise scalable entropy transformer to reduce the redundancy between layers. Based on the multi-task scalable rate-distortion objective, the proposed scheme is jointly optimized to achieve optimal machine analysis performance, human perception experience, and compression ratio. We validate the proposed paradigm's feasibility in face image compression. Extensive qualitative and quantitative experimental results demonstrate the superiority of the proposed paradigm over the latest compression standard Versatile Video Coding (VVC) in terms of both machine analysis as well as human perception at extremely low bitrates (< 0.01 bpp), offering new insights for human-machine collaborative compression.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Scalable Face Image Coding via StyleGAN Prior: Toward Compression for Human-Machine Collaborative Vision.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing

Lead the way for us

Journal: IEEE Transactions on Image Processing	Publication Date: Jan 1, 2024
Citations: 1

Similar Papers

Analyzing the Impact of Loss Functions on Dehazing Effectiveness and Unveiling the Discrepancy between Quantitative and Qualitative Results
Ali Murtaza ... Ahmad ‘Athif Mohd Fauzi
Journal of Advanced Research in Applied Sciences and Engineering Technology | VOL. 44
Ali Murtaza, et. al. Ali Murtaza ... Ahmad ‘Athif Mohd Fauzi
26 Apr 2024
Journal of Advanced Research in Applied Sciences and Engineering Technology | VOL. 44

Video quality evaluation and testing verification of H.264, HEVC, VVC and EVC video compression standards
Shreyanka Subbarayappa ... K R Rao
IOP Conference Series: Materials Science and Engineering | VOL. 1045
Shreyanka Subbarayappa, et. al.Shreyanka Subbarayappa ... K R Rao
01 Feb 2021
IOP Conference Series: Materials Science and Engineering | VOL. 1045

Statistical analysis of the QTMT structure: intra mode decision
Naima Zouidi ... Nouri Masmoudi
-
Naima Zouidi, et. al.Naima Zouidi ... Nouri Masmoudi
09 Dec 2020
09 Dec 2020

Performance Comparison of Weak Filtering in HEVC and VVC
Junghyun Lee ... Jechang Jeong
Electronics | VOL. 9
Junghyun Lee, et. al.Junghyun Lee ... Jechang Jeong
09 Jun 2020
Electronics | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Scalable Face Image Coding via StyleGAN Prior: Toward Compression for Human-Machine Collaborative Vision.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing