Abstract
AbstractThis paper proposes a high‐performance noiseless data compression model which can compress strings generated by a binary Markov information source without prior knowledge of the source. The adaptive expression of the existing universal codes is described by a model with separated sampler and encoder, and the learning structure of each code and the encoding process are described. Through such a discussion, the intuitive reasons are clarified as to why the codes, despite their asymptotic optimality, do not compress well at the start, pointing out room for improvement. Then, the MDL criterion is introduced into each encoder, to improve the performance, while keeping the sampler fixed. The MDL criterion is an estimation for the minimum code length, including the representation cost of the parameters. Based on this criterion, the Markov order which is to be assumed for each symbol is adaptively determined. Since the same determination can be made at the time of decoding, the unique decodability is guaranteed. In the second half of this paper, more detailed techniques are discussed, and a remarkable improvement is demonstrated by a computer simulation. It is also shown that the sampler of the proposed universal code is of a high‐performance, realizing two kinds of encoders.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: Electronics and Communications in Japan (Part I: Communications)
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.