Abstract

Recent studies have shown mining compressed frequent itemset patterns provides more benefits than mining the closed frequent patterns, since mining compressed frequent itemset patterns leads to more compact and representative result sets. Especially, it is quite meaningful in the environment of data stream where limited memory space and computation quality are major challenges. In this paper, the problem of mining compressed frequent itemset patterns over a data stream sliding windows is presented and studied. Firstly, a novel data structure CP-Tree (Compressed Pattern Tree) is designed to maintain a dynamically selected set of compressed frequent itemset patterns over sliding window. Secondly, an efficient algorithm CFPStream (Compressing Frequent Patterns over Stream) is developed to discover compressed frequent itemset patterns in data stream sliding windows incrementally. Finally, some optimization techniques are adopted in CFPStream to speed up the algorithm and prune search space. Experiments on both real and synthetic data sets show that CFPStream outperforms representative algorithms for the state-of-the-art approaches.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call