Abstract
This paper exploits a modified genetic programming (GP) approach for solving the data compression problem. In fact, the typical GP algorithm in which a candidate solution is expressed as a tree rather than a bit string, fails to solve that problem since it does not guarantee a one to one correspondence between a particular symbol and the corresponding codeword during subtree exchange operations. The nature of the problem requires generating one, and only one, codeword for each symbol of the underlying text. In the proposed scheme, the authors introduced three new operators, namely, insertion, two-level mutation and modified crossover. Accordingly, a modified version of GP is presented and applied on different data texts to validate the proposed approach. The developed algorithm can provide optimum codes since its final solution reaches Huffman tree. Moreover, it makes use of GP not only to allow optimum compression ratio but also to provide adaptive compression implementation. The adaptation is achieved so that the selection of the codebook depends on the nature of the input text. The proposed compression scheme is written in C++ and is implemented on different text types under various operational conditions. Accordingly, the algorithm performance has been measured and evaluated.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: International Journal of Information and Coding Theory
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.