An Efficient Tree-Based Algorithm for Mining High Average-Utility Itemset

Irfan Yildirim,Mete Celik

doi:10.1109/access.2019.2945840

Irfan Yildirim, Mete Celik

Open Access

https://doi.org/10.1109/access.2019.2945840

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 19	License type: CC BY 4.0

Affiliation: Erzurum Technical University, Erciyes University

Abstract

High-utility itemset mining (HUIM), which is an extension of well-known frequent itemset mining (FIM), has become a key topic in recent years. HUIM aims to find a complete set of itemsets having high utilities in a given dataset. High average-utility itemset mining (HAUIM) is a variation of traditional HUIM. HAUIM provides an alternative measurement named the average-utility to discover the itemsets by taking into consideration both of the utility values and lengths of itemsets. HAUIM is important for several application domains, such as, business applications, medical data analysis, mobile commerce, streaming data analysis, etc. In the literature, several algorithms have been proposed by introducing their own upper-bound models and data structures to discover high average utility itemsets (HAUIs) in a given database. However, they require long execution times and large memory consumption to handle the problem. To overcome these limitations, this paper, first, introduces four novel upper-bounds along with pruning strategies and two data structures. Then, it proposes a pattern growth approach called the HAUL-Growth algorithm for efficiently mining of HAUIs using the proposed upper-bounds and data structures. Experimental results show that the proposed HAUL-Growth algorithm significantly outperforms the state-of-the-art dHAUIM and TUB-HAUIM algorithms in terms of execution times, number of join operations, memory consumption, and scalability.

Highlights

Frequent itemset mining (FIM), which is one of the most well-known techniques to discover relations among items in large data, was originally introduced to discover frequently purchased itemsets by customers [1]–[4]
A typical high average-utility itemset mining (HAUIM) approach aims to find a complete set of high average utility itemsets (HAUIs) based on a given minimum utility threshold (minUtil) threshold
This study proposes an algorithm named as High Average-Utility List-Growth (HAUL-Growth) algorithm for mining HAUIs efficiently

Summary

INTRODUCTION

Frequent itemset mining (FIM), which is one of the most well-known techniques to discover relations among items in large data, was originally introduced to discover frequently purchased itemsets by customers [1]–[4]. The problem of high-utility itemset mining (HUIM) [5], [6] was introduced as an extension of FIM to discover more meaningful itemsets by taking into account non-binary attributes of items. Celik: Efficient Tree-Based Algorithm for Mining High Average-Utility Itemset most of the discovered HUIs may contain items with low utilities. To address these limitations, the problem of high average-utility mining (HAUIM) is introduced with a more fair measurement named average-utility [7]. A typical HAUIM approach aims to find a complete set of HAUIs based on a given minUtil threshold This process is computationally complex due to anti-monotonic characteristic of average-utilities of itemsets.

RELATED WORK

PROPOSED DATA STRUCTURES

VIII. CONCLUSION AND FUTURE WORKS

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Efficient Tree-Based Algorithm for Mining High Average-Utility Itemset

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Mining high average utility itemsets using artificial fish swarm algorithm with computed multiple minimum average utility thresholds
S.S Nandhini ... S Kannimuthu
Journal of Intelligent & Fuzzy Systems | VOL. 46
S.S Nandhini, et. al.S.S Nandhini ... S Kannimuthu
10 Jan 2024
Journal of Intelligent & Fuzzy Systems | VOL. 46

H-Map-Based Technique for Mining High Average Utility Itemset
M S Bhuvaneswari ... K Muneeswaran
IETE Journal of Research | VOL. ahead-of-print
M S Bhuvaneswari, et. al.M S Bhuvaneswari ... K Muneeswaran
27 May 2022
IETE Journal of Research | VOL. ahead-of-print

High Average Utility Itemset Mining: A Survey
Mathe John Kenny Kumar ... Dipti Rana
-
Mathe John Kenny Kumar, et. al.Mathe John Kenny Kumar ... Dipti Rana
21 Dec 2020
21 Dec 2020

TUB-HAUPM: Tighter Upper Bound for Mining High Average-Utility Patterns
Jimmy Ming-Tai Wu ... Matin Pirouz
IEEE Access | VOL. 6
Jimmy Ming-Tai Wu, et. al.Jimmy Ming-Tai Wu ... Matin Pirouz
01 Jan 2018
IEEE Access | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Efficient Tree-Based Algorithm for Mining High Average-Utility Itemset

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access