A review on compressed pattern matching

Surya Prakash Mishra,Col Gurmit Singh,Rajesh Prasad

doi:10.1016/j.pisc.2016.06.071

Abstract

Summary Compressed pattern matching (CPM) refers to the task of locating all the occurrences of a pattern (or set of patterns) inside the body of compressed text. In this type of matching, pattern may or may not be compressed. CPM is very useful in handling large volume of data especially over the network. It has many applications in computational biology, where it is useful in finding similar trends in DNA sequences; intrusion detection over the networks, big data analytics etc. Various solutions have been provided by researchers where pattern is matched directly over the uncompressed text. Such solution requires lot of space and consumes lot of time when handling the big data. Various researchers have proposed the efficient solutions for compression but very few exist for pattern matching over the compressed text. Considering the future trend where data size is increasing exponentially day-by-day, CPM has become a desirable task. This paper presents a critical review on the recent techniques on the compressed pattern matching. The covered techniques includes: Word based Huffman codes, Word Based Tagged Codes; Wavelet Tree Based Indexing. We have presented a comparative analysis of all the techniques mentioned above and highlighted their advantages and disadvantages.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Perspectives in Science	Publication Date: Jul 4, 2016
Citations: 3	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

A review on compressed pattern matching

Abstract

Talk to us

Similar Papers

More From: Perspectives in Science

Lead the way for us

Similar Papers

Analyzing the performance differences between pattern matching and compressed pattern matching on texts
Cihat Erdogan ... Banu Diri
-
Cihat Erdogan, et. al.Cihat Erdogan ... Banu Diri
01 Nov 2013
01 Nov 2013

An approach for fast compressed text matching and to avoid false matching using WBTC and wavelet tree
Shashank Srivastav ... P Singh
ICST Transactions on Scalable Information Systems | VOL. 8
Shashank Srivastav, et. al.Shashank Srivastav ... P Singh
13 Jul 2018
ICST Transactions on Scalable Information Systems | VOL. 8

Fast Pattern Matching in Compressed Text using Wavelet Tree
Surya Prakash Mishra ... Gurmit Singh
IETE Journal of Research | VOL. 64
Surya Prakash Mishra, et. al.Surya Prakash Mishra ... Gurmit Singh
25 Jul 2017
IETE Journal of Research | VOL. 64

Lecture on Progress toward Petascale Applications in Bioinformatics and Computational Biology
C.A Stewart ... D Bader
-
C.A Stewart, et. al.C.A Stewart ... D Bader
01 Oct 2007
01 Oct 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A review on compressed pattern matching

Abstract

Talk to us

Similar Papers

More From: Perspectives in Science