Abstract

Online pattern matching has become an increasingly important method for utilizing the discovered frequent patterns to build various intelligent systems, and obviously it demands high pattern matching efficiency. Memory-based online pattern matching is a solution, but it requires a frequent pattern base with small storage size. The closed itemsets are a lossless and condensed representation of all frequent itemsets, however, their itemset format does not facilitate implementing efficient pattern matching. This paper proposes a novel method that builds the bitmap inverted file from the closed itemsets and uses it to substitute for the closed itemsets to perform pattern matching. The bitmap inverted files use bitmaps instead of the referential lists to keep the maps of frequent items in the closed itemsets, so as to reduce the storage size and promote the intersecting operation efficiency. In order to completely substitute for the closed itemsets in pattern matching, the bitmap inverted files also keep other information relevant to the closed itemsets, such as their supports and lengths. Experiments show that on dense datasets, the storage size of the bitmap inverted files is much smaller than that of the set of closed itemsets, and the pattern matching based on the bitmap inverted files resident in memory is orders of magnitude more efficient than that based on the closed itemsets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.