Wavelet Trees: From Theory to Practice

Roberto Grossi,Jeffrey Scott Vitter,Bojian Xu

doi:10.1109/ccp.2011.16

Abstract

The \emph{wavelet tree} data structure is a space-efficient technique for rank and select queries that generalizes from binary characters to an arbitrary multicharacter alphabet. It has become a key tool in modern full-text indexing and data compression because of its capabilities in compressing, indexing, and searching. We present a comparative study of its practical performance regarding a wide range of options on the dimensions of different coding schemes and tree shapes. Our results are both theoretical and experimental: (1)~We show that the run-length $\delta$ coding size of wavelet trees achieves the 0-order empirical entropy size of the original string with leading constant 1, when the string's 0-order empirical entropy is asymptotically less than the logarithm of the alphabet size. This result complements the previous works that are dedicated to analyzing run-length $\gamma$-encoded wavelet trees. It also reveals the scenarios when run-length $\delta$ encoding becomes practical. (2)~We introduce a full generic package of wavelet trees for a wide range of options on the dimensions of coding schemes and tree shapes. Our experimental study reveals the practical performance of the various modifications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Wavelet Trees: From Theory to Practice

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

The myriad virtues of Wavelet Trees
Paolo Ferragina ... Giovanni Manzini
Information and Computation | VOL. 207
Paolo Ferragina, et. al.Paolo Ferragina ... Giovanni Manzini
29 Jan 2009
Information and Computation | VOL. 207

Canonical Huffman code based full-text index
Yi Zhang ... Yanchun Liang
Progress in Natural Science | VOL. 18
Yi Zhang, et. al.Yi Zhang ... Yanchun Liang
28 Jan 2008
Progress in Natural Science | VOL. 18

Wavelet Tree ensembles with Machine Learning and its classification
Neha Katiyar ... Arun Kumar Yadav
Journal of Physics: Conference Series | VOL. 1998
Neha Katiyar, et. al.Neha Katiyar ... Arun Kumar Yadav
01 Aug 2021
Journal of Physics: Conference Series | VOL. 1998

Rank and select revisited and extended
Veli Mäkinen ... Gonzalo Navarro
Theoretical Computer Science | VOL. 387
Veli Mäkinen, et. al.Veli Mäkinen ... Gonzalo Navarro
27 Jul 2007
Theoretical Computer Science | VOL. 387

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Wavelet Trees: From Theory to Practice

Abstract

Talk to us

Similar Papers