Mining top-k high utility patterns over data streams

Morteza Zihayat,Aijun An

doi:10.1016/j.ins.2014.01.045

Abstract

Online high utility itemset mining over data streams has been studied recently. However, the existing methods are not designed for producing top-k patterns. Since there could be a large number of high utility patterns, finding only top-k patterns is more attractive than producing all the patterns whose utility is above a threshold. A challenge with finding top-k high utility itemsets over data streams is that it is not easy for users to determine a proper minimum utility threshold in order for the method to work efficiently. In this paper, we propose a new method (named T-HUDS) for finding top-k high utility patterns over sliding windows of a data stream. The method is based on a compressed tree structure, called HUDS-tree, that can be used to efficiently find potential top-k high utility itemsets over sliding windows. T-HUDS uses a new utility estimation model to more effectively prune the search space. We also propose several strategies for initializing and dynamically adjusting the minimum utility threshold. We prove that no top-k high utility itemset is missed by the proposed method. Our experimental results on real and synthetic datasets show that our strategies and new utility estimation model work very effectively and that T-HUDS outperforms two state-of-the-art high utility itemset algorithms substantially in terms of execution time and memory storage.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mining top-k high utility patterns over data streams

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Journal: Information Sciences	Publication Date: Feb 3, 2014
Citations: 81

Similar Papers

TKU-CE: Cross-Entropy Method for Mining Top-K High Utility Itemsets
Wei Song ... Chaomin Huang
-
Wei Song, et. al.Wei Song ... Chaomin Huang
01 Jan 2020
01 Jan 2020

TKEH: an efficient algorithm for mining top-k high utility itemsets
Kuldeep Singh ... Bhaskar Biswas
Applied Intelligence | VOL. 49
Kuldeep Singh, et. al.Kuldeep Singh ... Bhaskar Biswas
25 Oct 2018
Applied Intelligence | VOL. 49

Mining high average utility itemsets using artificial fish swarm algorithm with computed multiple minimum average utility thresholds
S.S Nandhini ... S Kannimuthu
Journal of Intelligent & Fuzzy Systems | VOL. 46
S.S Nandhini, et. al.S.S Nandhini ... S Kannimuthu
10 Jan 2024
Journal of Intelligent & Fuzzy Systems | VOL. 46

Mining Top-k High On-shelf Utility Itemsets Using Novel Threshold Raising Strategies
Kuldeep Singh ... Bhaskar Biswas
ACM Transactions on Knowledge Discovery from Data | VOL. 18
Kuldeep Singh, et. al.Kuldeep Singh ... Bhaskar Biswas
26 Mar 2024
ACM Transactions on Knowledge Discovery from Data | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mining top-k high utility patterns over data streams

Abstract

Talk to us

Similar Papers

More From: Information Sciences