String indexing for top-k close consecutive occurrences

Philip Bille,Inge Li Gørtz,Max Rishøj Pedersen,Eva Rotenberg,Teresa Anna Steiner

doi:10.1016/j.tcs.2022.06.004

Philip Bille, Inge Li Gørtz + Show 3 more

Open Access

https://doi.org/10.1016/j.tcs.2022.06.004

Copy DOI

Abstract

The classic string indexing problem is to preprocess a string S into a compact data structure that supports efficient subsequent pattern matching queries, that is, given a pattern string P, report all occurrences of P within S. In this paper, we study a basic and natural extension of string indexing called the string indexing for top-k close consecutive occurrences problem (Sitcco). Here, a consecutive occurrence is a pair (i,j), i<j, such that P occurs at positions i and j in S and there is no occurrence of P between i and j, and their distance is defined as j−i. Given a pattern P and a parameter k, the goal is to report the top-k consecutive occurrences of P in S of minimal distance. The challenge is to compactly represent S while supporting queries in time close to the length of P and k. We give three time-space trade-offs for the problem. Let n be the length of S, m the length of P, and ϵ∈(0,1]. Our first result achieves O(nlog⁡n) space and optimal query time of O(m+k). Our second and third results achieve linear space and query times either O(m+k1+ϵ) or O(m+klog1+ϵ⁡n). Along the way, we develop several techniques of independent interest, including a new translation of the problem into a line segment intersection problem and a new recursive clustering technique for trees.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Theoretical Computer Science	Publication Date: Jun 3, 2022
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

String indexing for top-k close consecutive occurrences

Abstract

Talk to us

Similar Papers

More From: Theoretical Computer Science

Lead the way for us

Similar Papers

Gapped Indexing for Consecutive Occurrences
Philip Bille ... Teresa Anna Steiner
Algorithmica | VOL. 85
Philip Bille, et. al.Philip Bille ... Teresa Anna Steiner
20 Oct 2022
Algorithmica | VOL. 85

Gapped indexing for consecutive occurrences
...
-
, et. al. ...
01 Jan 2020
01 Jan 2020

Deterministic Indexing for Packed Strings
...
-
, et. al. ...
30 Aug 2017
30 Aug 2017

String Indexing with Compressed Patterns.
...
-
, et. al. ...
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

String indexing for top-k close consecutive occurrences

Abstract

Talk to us

Similar Papers

More From: Theoretical Computer Science