Lempel–Ziv Factorization Using Less Time &amp; Space

Gang Chen,W F Smyth,Simon J Puglisi

doi:10.1007/s11786-007-0024-4

Abstract

For 30 years the Lempel–Ziv factorization LZ x of a string x = x[1..n] has been a fundamental data structure of string processing, especially valuable for string compression and for computing all the repetitions (runs) in x. Traditionally the standard method for computing LZ x was based on Θ(n)-time (or, depending on the measure used, O(n log n)-time) processing of the suffix tree ST x of x. Recently Abouelhoda et al. proposed an efficient Lempel–Ziv factorization algorithm based on an “enhanced” suffix array – that is, a suffix array SA x together with supporting data structures, principally an “interval tree”. In this paper we introduce a collection of fast space-efficient algorithms for LZ factorization, also based on suffix arrays, that in theory as well as in many practical circumstances are superior to those previously proposed; one family out of this collection achieves true Θ(n)-time alphabet-independent processing in the worst case by avoiding tree structures altogether.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Lempel–Ziv Factorization Using Less Time & Space

Abstract

Talk to us

Similar Papers

More From: Mathematics in Computer Science

Lead the way for us

Journal: Mathematics in Computer Science	Publication Date: Apr 11, 2008
Citations: 109

Similar Papers

Fast and Practical Algorithms for Computing All the Runs in a String
Gang Chen ... W F Smyth
-
Gang Chen, et. al.Gang Chen ... W F Smyth
09 Jul 2007
09 Jul 2007

Parallel distributed memory construction of suffix and longest common prefix arrays
Patrick Flick ... Srinivas Aluru
-
Patrick Flick, et. al.Patrick Flick ... Srinivas Aluru
15 Nov 2015
15 Nov 2015

An Efficient Index Data Structure with the Capabilities of Suffix Trees and Suffix Arrays for Alphabets of Non-negligible Size
Dong Kyue Kim ... Jeong Eun Jeon
-
Dong Kyue Kim, et. al.Dong Kyue Kim ... Jeong Eun Jeon
01 Jan 2004
01 Jan 2004

Linearized Suffix Tree: an Efficient Index Data Structure with the Capabilities of Suffix Trees and Suffix Arrays
Dong Kyue Kim ... Minhwan Kim
Algorithmica | VOL. 52
Dong Kyue Kim, et. al.Dong Kyue Kim ... Minhwan Kim
24 Oct 2007
Algorithmica | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Lempel–Ziv Factorization Using Less Time &amp; Space

Abstract

Talk to us

Similar Papers

More From: Mathematics in Computer Science

Lempel–Ziv Factorization Using Less Time & Space