Let Sleeping Files Lie: Pattern Matching in Z-Compressed Files

Amihood Amir,Gary Benson,Martin Farach

doi:10.1006/jcss.1996.0023

Abstract

The current explosion of stored information necessitates a new model of pattern matching, that ofcompressed matching. In this model one tries to find all occurrences of a pattern in a compressed text in time proportional to the compressed text size,i.e., without decompressing the text. The most effective general purpose compression algorithms areadaptive, in that the text represented by each compression symbol is determined dynamically by the data. As a result, the encoding of a substring depends on its location. Thus the same substring may “look different” every time it appears in the compressed text. In this paper we consider pattern matching without decompression in the UNIX Z-compression. This is a variant of the Lempel–Ziv adaptive compression scheme. Ifnis the length of thecompressedtext andmis the length of the pattern, our algorithms find the first pattern occurrence in timeO(n+m2) orO(nlogm+m). We also introduce a new criterion to measure compressed matching algorithms, that ofextra space. We show how to modify our algorithms to achieve a trade-off between the amount of extra space used and the algorithm's time complexity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Computer and System Sciences	Publication Date: Apr 1, 1996
Citations: 160	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Let Sleeping Files Lie: Pattern Matching in Z-Compressed Files

Abstract

Talk to us

Similar Papers

More From: Journal of Computer and System Sciences

Lead the way for us

Similar Papers

Let sleeping files lie: pattern matching in Z-compressed files
...
-
, et. al. ...
23 Jan 1994
23 Jan 1994

Analyzing the performance differences between pattern matching and compressed pattern matching on texts
Cihat Erdogan ... Banu Diri
-
Cihat Erdogan, et. al.Cihat Erdogan ... Banu Diri
01 Nov 2013
01 Nov 2013

Inplace 2D matching in compressed images
...
-
, et. al. ...
12 Jan 2003
12 Jan 2003

A matching algorithms based on the depth first search for the general graph
Chengcheng Yu ... Zhonge Sheng
-
Chengcheng Yu, et. al.Chengcheng Yu ... Zhonge Sheng
01 Jan 2014
01 Jan 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Let Sleeping Files Lie: Pattern Matching in Z-Compressed Files

Abstract

Talk to us

Similar Papers

More From: Journal of Computer and System Sciences