Abstract

In this paper, a compressed membership problem for finite automata, both deterministic and non-deterministic, with compressed transition labels is studied. The compression is represented by straight-line programs (SLPs), i.e. context-free grammars generating exactly one string. A novel technique of dealing with SLPs is introduced: the SLPs are recompressed, so that substrings of the input text are encoded in SLPs labelling the transitions of the NFA (DFA) in the same way, as in the SLP representing the input text. To this end, the SLPs are locally decompressed and then recompressed in a uniform way. Furthermore, such recompression induces only small changes in the automaton, in particular, the size of the automaton remains polynomial. Using this technique it is shown that the compressed membership for NFA with compressed labels is in NP, thus confirming the conjecture of Plandowski and Rytter and extending the partial result of Lohrey and Mathissen; as it is already known, that this problem is NP-hard, we settle its exact computational complexity. Moreover, the same technique applied to the compressed membership for DFA with compressed labels yields that this problem is in P; for this problem, only trivial upper-bound PSPACE was known.

Highlights

  • 1.1 Compression and Straight-Line ProgrammesDue to ever-increasing amount of data, compression methods are widely applied in order to decrease the data’s size

  • The earlier work focused on the properties of strings described by straight-line programs (SLPs) and tried to use this knowledge in order to analyse the automaton and its input, hopefully this should result in an efficient algorithm

  • We focus on the SLPs, and not on the encoded strings

Read more

Summary

Compression and Straight-Line Programmes

Due to ever-increasing amount of data, compression methods are widely applied in order to decrease the data’s size. There is a large demand for algorithms working directly on the compressed representation of the data, without explicit decompression. A different approach is explored: for some applications and for most of theory-oriented considerations it would be useful to model the practical compression standard by a more mathematically well-founded method. This idea lay at the foundations of the notion of Straight-Line Programms (SLP), whose instances can be seen as context-free grammars generating exactly one string. The recent stateof-the-art efficient algorithms for pattern matching in LZ compressed text essentially use the reformulation of LZ methods in terms of SLPs [11]. For more information about the SLPs and their applications, please look at the recent survey of Lohrey [29]

Membership Problem
Our Results and Techniques
Similar Techniques
Other Applications of the Technique
Straight Line Programmes
Known Results
Basic Classifications and Outline of the Algorithm
Outline of the Algorithm
Invariants
Compression of Pairs
Compression of Non-crossing Pairs
Crossing Pair Compression
Blocks Compression
Compression of Noncrossing Blocks
Removing Crossing Blocks of a Letter
Running Time and Correctness
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.