A fast high average-utility itemset mining with efficient tighter upper bounds and novel list structure

Krishan Kumar Sethi,Dharavath Ramesh

doi:10.1007/s11227-020-03247-5

Abstract

High-utility itemset mining is a prominent data-mining technique where the profit or weight of itemsets plays a crucial role in defining meaningful patterns. High average-utility itemset (HAUI) mining is an advancement over high-utility itemset mining, which introduces an unbiased measure called average utility to associate the utility of itemsets with their length. Several existing HAUI mining algorithms use various upper bounds such as average-utility upper bound, revised tighter upper bound, and looser upper bound to preserve pruning methods. However, these upper bounds overestimate the average-utility of itemsets and slow down the mining process. This paper presents a fast high average-utility itemset miner (FHAIM) algorithm, which uses two improved upper bounds and several efficient pruning strategies to avoid the processing of unpromising candidate itemsets. Moreover, a novel list structure named recommended average-utility list (RAUL) is presented to store the average-utility and the required information for pruning. The RAUL for an itemset can be constructed by joining the RAULs of its subsets to avoid excessive database scans. We have performed substantial experiments on various benchmark datasets to evaluate the performance of the FHAIM in comparison with two existing HAUI mining algorithms. Experimental results show that FHAIM outperforms the existing HAUI mining algorithms in terms of runtime, memory usage, join counts, and scalability.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A fast high average-utility itemset mining with efficient tighter upper bounds and novel list structure

Abstract

Talk to us

Similar Papers

More From: The Journal of Supercomputing

Lead the way for us

Journal: The Journal of Supercomputing	Publication Date: Mar 18, 2020
Citations: 16

Similar Papers

Mining high average utility itemsets using artificial fish swarm algorithm with computed multiple minimum average utility thresholds
S.S Nandhini ... S Kannimuthu
Journal of Intelligent & Fuzzy Systems | VOL. 46
S.S Nandhini, et. al.S.S Nandhini ... S Kannimuthu
10 Jan 2024
Journal of Intelligent & Fuzzy Systems | VOL. 46

TUB-HAUPM: Tighter Upper Bound for Mining High Average-Utility Patterns
Jimmy Ming-Tai Wu ... Matin Pirouz
IEEE Access | VOL. 6
Jimmy Ming-Tai Wu, et. al.Jimmy Ming-Tai Wu ... Matin Pirouz
01 Jan 2018
IEEE Access | VOL. 6

H-Map-Based Technique for Mining High Average Utility Itemset
M S Bhuvaneswari ... K Muneeswaran
IETE Journal of Research | VOL. ahead-of-print
M S Bhuvaneswari, et. al.M S Bhuvaneswari ... K Muneeswaran
27 May 2022
IETE Journal of Research | VOL. ahead-of-print

High Average Utility Itemset Mining: A Survey
Mathe John Kenny Kumar ... Dipti Rana
-
Mathe John Kenny Kumar, et. al.Mathe John Kenny Kumar ... Dipti Rana
21 Dec 2020
21 Dec 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A fast high average-utility itemset mining with efficient tighter upper bounds and novel list structure

Abstract

Talk to us

Similar Papers

More From: The Journal of Supercomputing