Abstract

Mining association rules with constraints allow us to concentrate on discovering a useful subset instead of the complete set of association rules. With the aim of satisfying the needs of users and improving the efficiency and effectiveness of mining task, many various constraints and mining algorithms have been proposed. In practice, finding rules regarding specific itemsets is of interest. Thus, this paper considers the problem of mining association rules whose left-hand and right-hand sides contain two given itemsets, respectively. In addition, they also have to satisfy two given maximum support and confidence constraints. Applying previous algorithms to solve this problem may encounter disadvantages, such as the generation of many redundant candidates, time-consuming constraint check and the repeated reading of the database when the constraints are changed. The paper proposes an equivalence relation using the closure of itemset to partition the solution set into disjoint equivalence classes and a new, efficient representation of the rules in each class based on the lattice of closed itemsets and their generators. The paper also develops a new algorithm, called MAR-MINSC, to rapidly mine all constrained rules from the lattice instead of mining them directly from the database. Theoretical results are proven to be reliable. Because MAR-MINSC does not meet drawbacks above, in extensive experiments on many databases it obtains the outstanding performance in comparison with some of existing algorithms in mining association rules with the constraints mentioned.

Highlights

  • For the aim of reducing the burden of storage and execution time and rapidly responding to the demand of users, constraint-based data mining has attracted much interest and attention from researchers

  • Prior to presenting an appropriate approach to discover the rules with minimum single constraints without (P2), let us recall some of the following basic concepts about the lattice of closed itemsets and the task of association rule mining

  • To briefly present the results regarding the representation of the rule sides, we first consider a fairly general representation of frequent sub-items of Y that are restricted on X with minimum single constraint

Read more

Summary

Introduction

For the aim of reducing the burden of storage and execution time and rapidly responding to the demand of users, constraint-based data mining has attracted much interest and attention from researchers. Association rules are mined, where the minimum confidence constraint is other primitive one. Vietnam J Comput Sci (2015) 2:67–83 if there only have the constraints about support and confidence To solve this problem (P1), many more complicated constraints have been introduced into algorithms to only generate association rules related directly to the user’s true needs, and to reduce the cost of the mining. Monotonic and antimonotonic constraints, denoted as Cm and Cam respectively, are considered by Nguyen et al [25] They are pushed into an Apriori-like algorithm, named CAP, to reduce the frequent itemsets computation. With a suitable approach, the papers propose efficient algorithms, named MFS-Contain-IC and MFS_DoubleCons, for discovering frequent itemsets with the constraints mentioned. Let us state our problem as in sub-section below

Problem statement
Paper contribution
Preliminary concepts and notations
Approaches
Related works
Rough partition
Smooth partition of association rule set with minimum single constraints
The explicitly unique representation and the structure of an extended class
Experimental results
Conclusions and future works
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call