Linear-size suffix tries

Maxime Crochemore,Chiara Epifanio,Roberto Grossi,Filippo Mignosi

doi:10.1016/j.tcs.2016.04.002

Maxime Crochemore, Chiara Epifanio + Show 2 more

Open Access

https://doi.org/10.1016/j.tcs.2016.04.002

Copy DOI

Abstract

Suffix trees are highly regarded data structures for text indexing and string algorithms [MCreight 76, Weiner 73]. For any given string w of length n=|w|, a suffix tree for w takes O(n) nodes and links. It is often presented as a compacted version of a suffix trie for w, where the latter is the trie (or digital search tree) built on the suffixes of w. Here the compaction process replaces each maximal chain of unary nodes with a single arc. For this, the suffix tree requires that the labels of its arcs are substrings encoded as pointers to w (or equivalent information). On the contrary, the arcs of the suffix trie are labeled by single symbols but there can be Θ(n2) nodes and links for suffix tries in the worst case because of their unary nodes. It is an interesting question if the suffix trie can be stored using O(n) nodes. We present the linear-size suffix trie, which guarantees O(n) nodes. We use a new technique for reducing the number of unary nodes to O(n), that stems from some results on antidictionaries. For instance, by using the linear-size suffix trie, we are able to check whether a pattern p of length m=|p| occurs in w in O(mlog⁡|Σ|) time and we can find the longest common substring of two strings w1 and w2 in O((|w1|+|w2|)log⁡|Σ|) time for an alphabet Σ.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Theoretical Computer Science	Publication Date: Apr 7, 2016
Citations: 10	License type: elsevier-specific: oa user license

R Discovery Prime

R Discovery Prime

Linear-size suffix tries

Abstract

Talk to us

Similar Papers

More From: Theoretical Computer Science

Lead the way for us

Similar Papers

An Efficient Index Data Structure with the Capabilities of Suffix Trees and Suffix Arrays for Alphabets of Non-negligible Size
Dong Kyue Kim ... Jeong Eun Jeon
-
Dong Kyue Kim, et. al.Dong Kyue Kim ... Jeong Eun Jeon
01 Jan 2004
01 Jan 2004

Linearized Suffix Tree: an Efficient Index Data Structure with the Capabilities of Suffix Trees and Suffix Arrays
Dong Kyue Kim ... Minhwan Kim
Algorithmica | VOL. 52
Dong Kyue Kim, et. al.Dong Kyue Kim ... Minhwan Kim
24 Oct 2007
Algorithmica | VOL. 52

Solving All-Pairs Suffix Prefix – Theory and Practice
Maan Haj Rachid ... Qutaibah Malluhi
-
Maan Haj Rachid, et. al.Maan Haj Rachid ... Qutaibah Malluhi
01 Jan 2015
01 Jan 2015

On the shape of the fringe of various types of random trees
Michael Drmota ... Alois Panholzer
Mathematical Methods in the Applied Sciences | VOL. 32
Michael Drmota, et. al.Michael Drmota ... Alois Panholzer
07 Nov 2008
Mathematical Methods in the Applied Sciences | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Linear-size suffix tries

Abstract

Talk to us

Similar Papers

More From: Theoretical Computer Science