In a lossless compression system with target lengths, a compressor 𝒞 maps an integer m and a binary string x to an m -bit code p , and if m is sufficiently large, a decompressor 𝒟 reconstructs x from p . We call a pair ( m,x ) achievable for (𝒞,𝒟) if this reconstruction is successful. We introduce the notion of an optimal compressor 𝒞 opt by the following universality property: For any compressor-decompressor pair (𝒞,𝒟), there exists a decompressor 𝒟 ′ such that if (m,x) is achievable for (𝒞,𝒟), then ( m + Δ , x ) is achievable for (𝒞 opt , 𝒟 ′ ), where Δ is some small value called the overhead. We show that there exists an optimal compressor that has only polylogarithmic overhead and works in probabilistic polynomial time. Differently said, for any pair (𝒞,𝒟), no matter how slow 𝒞 is, or even if 𝒞 is non-computable, 𝒞 opt is a fixed compressor that in polynomial time produces codes almost as short as those of 𝒞. The cost is that the corresponding decompressor is slower. We also show that each such optimal compressor can be used for distributed compression, in which case it can achieve optimal compression rates as given in the Slepian–Wolf theorem and even for the Kolmogorov complexity variant of this theorem.
Read full abstract