Position heaps: A simple and dynamic text indexing data structure

Andrzej Ehrenfeucht,Ross M Mcconnell,Nissa Osheim,Sung-Whan Woo

doi:10.1016/j.jda.2010.12.001

Andrzej Ehrenfeucht, Ross M Mcconnell + Show 2 more

Open Access

https://doi.org/10.1016/j.jda.2010.12.001

Copy DOI

Abstract

We address the problem of finding the locations of all instances of a string P in a text T, where preprocessing of T is allowed in order to facilitate the queries. Previous data structures for this problem include the suffix tree, the suffix array, and the compact DAWG. We modify a data structure called a sequence tree, which was proposed by Coffman and Eve (1970) [3] for hashing, and adapt it to the new problem. We can then produce a list of k occurrences of any string P in T in O ( ‖ P ‖ + k ) time. Because of properties shared by suffixes of a text that are not shared by arbitrary hash keys, we can build the structure in O ( ‖ T ‖ ) time, which is much faster than Coffman and Eve's algorithm. These bounds are as good as those for the suffix tree, suffix array, and the compact DAWG. The advantages are the elementary nature of some of the algorithms for constructing and using the data structure and the asymptotic bounds we can give for updating the data structure when the text is edited.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Discrete Algorithms	Publication Date: Dec 9, 2010
Citations: 54	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Position heaps: A simple and dynamic text indexing data structure

Abstract

Talk to us

Similar Papers

More From: Journal of Discrete Algorithms

Lead the way for us

Similar Papers

Contracted Suffix Trees: A Simple and Dynamic Text Indexing Data Structure
Andrzej Ehrenfeucht ... Ross M Mcconnell
-
Andrzej Ehrenfeucht, et. al.Andrzej Ehrenfeucht ... Ross M Mcconnell
01 Jan 2009
01 Jan 2009

Solving All-Pairs Suffix Prefix – Theory and Practice
Maan Haj Rachid ... Qutaibah Malluhi
-
Maan Haj Rachid, et. al.Maan Haj Rachid ... Qutaibah Malluhi
01 Jan 2015
01 Jan 2015

The Position Heap of a Trie
Yuto Nakashima ... Hideo Bannai
-
Yuto Nakashima, et. al.Yuto Nakashima ... Hideo Bannai
01 Jan 2012
01 Jan 2012

A time and space efficient data structure for string searching on large texts
Livio Colussi ... Alessia De Col
Information Processing Letters | VOL. 58
Livio Colussi, et. al.Livio Colussi ... Alessia De Col
01 Jun 1996
Information Processing Letters | VOL. 58

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Position heaps: A simple and dynamic text indexing data structure

Abstract

Talk to us

Similar Papers

More From: Journal of Discrete Algorithms