• An efficient Approach is proposed for Mining Maximized Erasable Patterns with External Utility. • Novel list structures are devised to manage minimum non-banary data. • Techniques and algorithms are suggested to perform efficiently construction and mining. • Performance improvements are shown with various tests in terms of runtime, memory usage, and scalability. As a method of extracting important information in the real world, the field of pattern mining has been actively studied. In particular, in industrial fields such as factories, it is also important to find patterns with low value and remove them in order to overcome the economic crisis. Erasable pattern mining has emerged to solve the concern. Traditional erasable pattern mining targeted a binary database containing only information on the existence of items. However, in the real world, data has a variety of properties, such as quantity and profit. Considering these properties makes it possible for more meaningful result patterns to be mined. Therefore, we propose a maximized erasable pattern mining algorithm that uses a list structure as a data structure and takes into account the quantity and price of the item. The proposed method considers the amount and relative benefit of each item to estimate the profit of a transaction and extracts the maximized erasable patterns considering the user’s preference. Moreover, the list structure utilized in the proposed method contains only the essential information for the mining process, which not only allows the mining to be performed efficiently, but also reduces memory usage. Using various real-world and synthetic datasets, we evaluate runtime, memory efficiency, and scalability of the suggested technique and state-of-the-art competitors. The evaluation results demonstrate that the proposed method performs better.
Read full abstract