On Approximate Jumbled Pattern Matching in Strings

Péter Burcsi,Zsuzsanna Lipták,Ferdinando Cicalese,Gabriele Fici

doi:10.1007/s00224-011-9344-5

Abstract

Given a string s, the Parikh vector of s, denoted p(s), counts the multiplicity of each character in s. Searching for a match of a Parikh vector q in the text s requires finding a substring t of s with p(t)=q. This can be viewed as the task of finding a jumbled (permuted) version of a query pattern, hence the term Jumbled Pattern Matching. We present several algorithms for the approximate version of the problem: Given a string s and two Parikh vectors u,v (the query bounds), find all maximal occurrences in s of some Parikh vector q such that u≤q≤v. This definition encompasses several natural versions of approximate Parikh vector search. We present an algorithm solving this problem in sub-linear expected time using a wavelet tree of s, which can be computed in time O(n) in a preprocessing phase. We then discuss a Scrabble-like variation of the problem, in which a weight function on the letters of s is given and one has to find all occurrences in s of a substring t with maximum weight having Parikh vector p(t)≤v. For the case of a binary alphabet, we present an algorithm which solves the decision version of the Approximate Jumbled Pattern Matching problem in constant time, by indexing the string in subquadratic time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On Approximate Jumbled Pattern Matching in Strings

Abstract

Talk to us

Similar Papers

More From: Theory of Computing Systems

Lead the way for us

Journal: Theory of Computing Systems	Publication Date: Jun 11, 2011
Citations: 67

Similar Papers

A Method for the Design of Petri Net Controller Enforcing General Linear Constraints
Shou-Guang Wang
Journal of Software | VOL. 16
Shou-Guang WangShou-Guang Wang
01 Jan 2004
Journal of Software | VOL. 16

On Prefix Normal Words
Gabriele Fici ... Zsuzsanna Lipták
-
Gabriele Fici, et. al.Gabriele Fici ... Zsuzsanna Lipták
01 Jan 2010
01 Jan 2010

On Table Arrangements, Scrabble Freaks, and Jumbled Pattern Matching
Péter Burcsi ... Ferdinando Cicalese
-
Péter Burcsi, et. al.Péter Burcsi ... Ferdinando Cicalese
01 Jan 2009
01 Jan 2009

On prefix normal words and prefix normal forms
Péter Burcsi ... Joe Sawada
Theoretical Computer Science | VOL. 659
Péter Burcsi, et. al.Péter Burcsi ... Joe Sawada
14 Nov 2016
Theoretical Computer Science | VOL. 659

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On Approximate Jumbled Pattern Matching in Strings

Abstract

Talk to us

Similar Papers

More From: Theory of Computing Systems