Large-Alphabet Semi-Static Entropy Coding Via Asymmetric Numeral Systems

Alistair Moffat,Matthias Petri

doi:10.1145/3397175

Abstract

An entropy coder takes as input a sequence of symbol identifiers over some specified alphabet and represents that sequence as a bitstring using as few bits as possible, typically assuming that the elements of the sequence are independent of each other. Previous entropy coding methods include the well-known Huffman and arithmetic approaches. Here we examine the newer asymmetric numeral systems (ANS) technique for entropy coding and develop mechanisms that allow it to be efficiently used when the size of the source alphabet is large—thousands or millions of symbols. In particular, we examine different ways in which probability distributions over large alphabets can be approximated and in doing so infer techniques that allow the ANS mechanism to be extended to support large-alphabet entropy coding. As well as providing a full description of ANS, we also present detailed experiments using several different types of input, including data streams arising as typical output from the modeling stages of text compression software, and compare theproposed ANS variants with Huffman and arithmetic coding baselines, measuring both compression effectiveness and also encoding and decoding throughput. We demonstrate that in applications in which semi-static compression is appropriate, ANS-based coders can provide an excellent balance between compression effectiveness and speed, even when the alphabet is large.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Large-Alphabet Semi-Static Entropy Coding Via Asymmetric Numeral Systems

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Information Systems

Lead the way for us

Journal: ACM Transactions on Information Systems	Publication Date: Jul 21, 2020
Citations: 8

Similar Papers

The use of asymmetric numeral systems as an accurate replacement for Huffman coding
Jarek Duda ... Neeraj J Gadgil
-
Jarek Duda, et. al.Jarek Duda ... Neeraj J Gadgil
01 May 2015
01 May 2015

Index Compression Using Byte-Aligned ANS Coding and Two-Dimensional Contexts
Alistair Moffat ... Matthias Petri
-
Alistair Moffat, et. al.Alistair Moffat ... Matthias Petri
02 Feb 2018
02 Feb 2018

Massively Parallel ANS Decoding on GPUs
André Weißenberger ... Bertil Schmidt
-
André Weißenberger, et. al.André Weißenberger ... Bertil Schmidt
05 Aug 2019
05 Aug 2019

High throughput hardware architectures for asymmetric numeral systems entropy coding
Seyyed Mahdi Najmabadi ... Yousef Baroud
-
Seyyed Mahdi Najmabadi, et. al.Seyyed Mahdi Najmabadi ... Yousef Baroud
01 Sep 2015
01 Sep 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Large-Alphabet Semi-Static Entropy Coding Via Asymmetric Numeral Systems

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Information Systems