Approximate string matching using compressed suffix arrays

Trinh N.D Huynh,Wing-Kai Hon,Tak-Wah Lam,Wing-Kin Sung

doi:10.1016/j.tcs.2005.11.022

Trinh N.D Huynh, Wing-Kai Hon + Show 2 more

Open Access

https://doi.org/10.1016/j.tcs.2005.11.022

Copy DOI

Abstract

Let T be a text of length n and P be a pattern of length m , both strings over a fixed finite alphabet A . The k -difference ( k -mismatch, respectively) problem is to find all occurrences of P in T that have edit distance (Hamming distance, respectively) at most k from P . In this paper we investigate a well-studied case in which T is fixed and preprocessed into an indexing data structure so that any pattern query can be answered faster. We give a solution using an O ( n log n ) bits indexing data structure with O ( | A | k m k · max ( k , log n ) + occ ) query time, where occ is the number of occurrences. The best previous result requires O ( n log n ) bits indexing data structure and gives O ( | A | k m k + 2 + occ ) query time. Our solution also allows us to exploit compressed suffix arrays to reduce the indexing space to O ( n ) bits, while increasing the query time by an O ( log n ) factor only.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Theoretical Computer Science	Publication Date: Dec 20, 2005
Citations: 56	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Approximate string matching using compressed suffix arrays

Abstract

Talk to us

Similar Papers

More From: Theoretical Computer Science

Lead the way for us

Similar Papers

Approximate String Matching Using Compressed Suffix Arrays
Trinh N D Huynh ... Wing-Kai Hon
-
Trinh N D Huynh, et. al.Trinh N D Huynh ... Wing-Kai Hon
01 Jan 2004
01 Jan 2004

Range Minimum Query Indexes in Higher Dimensions
Pooya Davoodi ... Moshe Lewenstein
-
Pooya Davoodi, et. al.Pooya Davoodi ... Moshe Lewenstein
01 Jan 2015
01 Jan 2015

Space-efficient indexes for forbidden extension queries
Sudip Biswas ... Sharma V Thankachan
Journal of Discrete Algorithms | VOL. 50
Sudip Biswas, et. al.Sudip Biswas ... Sharma V Thankachan
01 May 2018
Journal of Discrete Algorithms | VOL. 50

Improved Approximate String Matching Using Compressed Suffix Data Structures
Tak-Wah Lam ... Swee-Seong Wong
-
Tak-Wah Lam, et. al.Tak-Wah Lam ... Swee-Seong Wong
01 Jan 2004
01 Jan 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Approximate string matching using compressed suffix arrays

Abstract

Talk to us

Similar Papers

More From: Theoretical Computer Science