Abstract

In recent years, frequent pattern mining from uncertain data has been actively researched in data mining. There are numerous exact and upper bound-based approaches for uncertain frequent pattern mining. Exact-based algorithms may produce a large data structure and need time-consuming calculations and upper bound-based algorithms may produce many false positives. As a result, these algorithms demand much time and memory. There have been efforts to resolve the problem of upper bound-based algorithms, however, all of these methods only try to tighten the upper bound of expected support for long patterns. This is while pruning infrequent short patterns has a greater impact on reducing the false positives. To overcome these drawbacks, in this paper an efficient method based on upper bound is proposed for mining uncertain frequent patterns. The proposed method uses a new Tightened upper bound to expected support of patterns (Tup) which has a significant effect on reducing the number of false positives by tightening the upper bound of expected support and early pruning of infrequent 2-itemsets and their supersets. Comprehensive experimental results show that the proposed method reduces memory consumption in most cases and dramatically improves the performance of exact and upper bound-based methods in terms of runtime and scalability for dense and sparse uncertain data.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call