High Utility Itemset Mining Algorithms Research Articles

In recent years, mining high-utility itemsets (HUIs) has emerged as a key topic in data mining. It consists of discovering sets of items generating a high profit in a transactional database by considering both purchase quantities and unit profits of items. Many algorithms have been proposed for this task. However, most of them assume the unrealistic assumption that unit profits of items remain unchanged over time. But in real-life, the profit of an item or itemset varies as a function of cost prices, sales prices and sale strategies. Recently, a three-phase algorithm has been proposed to mine HUIs, while considering that each item may have different discount strategies. However, the complete set of HUIs cannot be retrieved based on the traditional TWU model with its defined discount strategies. Moreover, it suffers from the well-known drawbacks of Apriori-based algorithms such as maintaining a huge amount of candidates in memory and repeatedly performing time-consuming database scans. In this paper, a HUI-DTP algorithm for mining HUIs when considering discount strategies of items is introduced. The HUI-DTP is designed as a two-phase algorithm to mine the complete set of HUIs based on a novel downward closure property and a vertical TID-list structure. Furthermore, the HUI-DMiner is an algorithm relying on a compact data structure (Positive-and-Negative Utility-list, PNU-list) and properties of two new pruning strategies to efficiently discover HUIs without candidate generation, while considerably reducing the size of the search space. Moreover, a strategy named Estimated Utility Co-occurrence Strategy which stores the relationships between 2-itemsets is also applied in the improved HUI-DEMiner algorithm to speed up computation. An extensive experimental study carried on several real-life datasets shows that the proposed algorithms outperform the previous best algorithm in terms of runtime, memory consumption and scalability.

High-utility itemset mining (HUIM) is a useful set of techniques for discovering patterns in transaction databases, which considers both quantity and profit of items. However, most algorithms for mining high-utility itemsets (HUIs) assume that the information stored in databases is precise, i.e., that there is no uncertainty. But in many real-life applications, an item or itemset is not only present or absent in transactions but is also associated with an existence probability. This is especially the case for data collected experimentally or using noisy sensors. In the past, many algorithms were respectively proposed to effectively mine frequent itemsets in uncertain databases. But mining HUIs in an uncertain database has not yet been proposed, although uncertainty is commonly seen in real-world applications. In this paper, a novel framework, named potential high-utility itemset mining (PHUIM) in uncertain databases, is proposed to efficiently discover not only the itemsets with high utilities but also the itemsets with high existence probabilities in an uncertain database based on the tuple uncertainty model. The PHUI-UP algorithm (potential high-utility itemsets upper-bound-based mining algorithm) is first presented to mine potential high-utility itemsets (PHUIs) using a level-wise search. Since PHUI-UP adopts a generate-and-test approach to mine PHUIs, it suffers from the problem of repeatedly scanning the database. To address this issue, a second algorithm named PHUI-List (potential high-utility itemsets PU-list-based mining algorithm) is also proposed. This latter directly mines PHUIs without generating candidates, thanks to a novel probability-utility-list (PU-list) structure, thus greatly improving the scalability of PHUI mining. Substantial experiments were conducted on both real-life and synthetic datasets to assess the performance of the two designed algorithms in terms of runtime, number of patterns, memory consumption, and scalability.

High Utility Itemset Mining Algorithms Research Articles

Related Topics

Articles published on High Utility Itemset Mining Algorithms

An empirical evaluation of high utility itemset mining algorithms

A survey of incremental high‐utility itemset mining

P-FHM+: Parallel high utility itemset mining algorithm for big data processing

Mining High Utility Itemsets Using Bio-Inspired Algorithms: A Diverse Optimal Value Framework

Review on high utility itemset mining algorithms for big data

A two-phase approach to mine short-period high-utility itemsets in transactional databases

FHN: An efficient algorithm for mining high-utility itemsets with negative unit profits

Approximate Parallel High Utility Itemset Mining

Binary partition for itemsets expansion in mining high utility itemsets

Efficiently mining uncertain high-utility itemsets

An efficient algorithm to mine high average-utility itemsets

Fast algorithms for mining high-utility itemsets with various discount strategies

Efficient algorithms for mining high-utility itemsets in uncertain databases

Review on High Utility Itemset Mining Algorithms

Recommender System for Academic Literature with Incremental Dataset

A high utility itemset mining algorithm based on subsume index

Maintaining the discovered high-utility itemsets with transaction modification

Efficient Algorithms for Mining the Concise and Lossless Representation of High Utility Itemsets

Semantic trajectory-based high utility item recommendation system

An Algorithm of Top-k High Utility Itemsets Mining over Data Stream

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

High Utility Itemset Mining Algorithms Research Articles

Related Topics

Articles published on High Utility Itemset Mining Algorithms

An empirical evaluation of high utility itemset mining algorithms

A survey of incremental high‐utility itemset mining

P-FHM+: Parallel high utility itemset mining algorithm for big data processing

Mining High Utility Itemsets Using Bio-Inspired Algorithms: A Diverse Optimal Value Framework

Review on high utility itemset mining algorithms for big data

A two-phase approach to mine short-period high-utility itemsets in transactional databases

FHN: An efficient algorithm for mining high-utility itemsets with negative unit profits

Approximate Parallel High Utility Itemset Mining

Binary partition for itemsets expansion in mining high utility itemsets

Efficiently mining uncertain high-utility itemsets

An efficient algorithm to mine high average-utility itemsets

Fast algorithms for mining high-utility itemsets with various discount strategies

Efficient algorithms for mining high-utility itemsets in uncertain databases

Review on High Utility Itemset Mining Algorithms

Recommender System for Academic Literature with Incremental Dataset

A high utility itemset mining algorithm based on subsume index

Maintaining the discovered high-utility itemsets with transaction modification

Efficient Algorithms for Mining the Concise and Lossless Representation of High Utility Itemsets

Semantic trajectory-based high utility item recommendation system

An Algorithm of Top-k High Utility Itemsets Mining over Data Stream