High utility pattern mining (HUPM) extends frequent pattern mining (FPM) by including item significance and quantity, which determine their utility, in transaction databases. A pattern is a high utility if its utility satisfies the minimum threshold. Several efficient approaches have been developed to address this task. However, more extensive databases combined with lower thresholds can lead to an explosion in the number of discovered patterns. Use the closed version of the pattern to decrease patterns while keeping information. To improve real-world applicability, HUPM research now considers item connections. Categorizing items creates an abstraction model that can help decision-makers. This new mining task is generalized high utility pattern mining (GHUPM). However, GHUPM significantly increases the complexity of the mining task, resulting in longer mining times, higher memory usage, and a more significant number of discovered patterns. To date, none of the proposed approaches in GHUPM have utilized the closed representation to shrink the size of the result set. This study introduces MLC-Miner, a multi-level closed high utility pattern extraction technique, to fill this research gap. MLC-Miner combines several efficient methods to enhance mining performance and reduce memory footprints. Experimental results show that MLC-Miner can handle up to 5 million transactions efficiently.
Read full abstract