Mining High Utility Patterns in One Phase without Generating Candidates

Junqiang Liu,Ke Wang,Benjamin C.M Fung

doi:10.1109/tkde.2015.2510012

Abstract

Utility mining is a new development of data mining technology. Among utility mining problems, utility mining with the itemset share framework is a hard one as no anti-monotonicity property holds with the interestingness measure. Prior works on this problem all employ a two-phase, candidate generation approach with one exception that is however inefficient and not scalable with large databases. The two-phase approach suffers from scalability issue due to the huge number of candidates. This paper proposes a novel algorithm that finds high utility patterns in a single phase without generating candidates. The novelties lie in a high utility pattern growth approach, a lookahead strategy, and a linear data structure. Concretely, our pattern growth approach is to search a reverse set enumeration tree and to prune search space by utility upper bounding. We also look ahead to identify high utility patterns without enumeration by a closure property and a singleton property. Our linear data structure enables us to compute a tight bound for powerful pruning and to directly identify high utility patterns in an efficient and scalable way, which targets the root cause with prior algorithms. Extensive experiments on sparse and dense, synthetic and real world data suggest that our algorithm is up to 1 to 3 orders of magnitude more efficient and is more scalable than the state-of-the-art algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mining High Utility Patterns in One Phase without Generating Candidates

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: May 1, 2016
Citations: 155

Similar Papers

Mining High Utility Patterns in One Phase without Generating Candidates
Rashmi Rashmi Reddy M
International Journal Of Engineering And Computer Science | VOL. 6
Rashmi Rashmi Reddy MRashmi Rashmi Reddy M
09 Jun 2017
International Journal Of Engineering And Computer Science | VOL. 6

Interactive mining of high utility patterns over data streams
Chowdhury Farhan Ahmed ... Ho-Jin Choi
Expert Systems With Applications | VOL. 39
Chowdhury Farhan Ahmed, et. al.Chowdhury Farhan Ahmed ... Ho-Jin Choi
01 Apr 2012
Expert Systems With Applications | VOL. 39

One scan based high average-utility pattern mining in static and dynamic databases
Jongseong Kim ... Philippe Fournier-Viger
Future Generation Computer Systems | VOL. 111
Jongseong Kim, et. al.Jongseong Kim ... Philippe Fournier-Viger
30 Apr 2020
Future Generation Computer Systems | VOL. 111

Mining recent high average utility patterns based on sliding window from stream data
Unil Yun ... Kyung-Min Lee
Journal of Intelligent & Fuzzy Systems | VOL. 30
Unil Yun, et. al.Unil Yun ... Kyung-Min Lee
30 Apr 2016
Journal of Intelligent & Fuzzy Systems | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mining High Utility Patterns in One Phase without Generating Candidates

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering