Abstract

Frequent itemsets discovery is popular in database communities recently. Because real data is often affected by noise, in this paper, we study to find frequent itemsets over probabilistic database under the Possible World Semantics. It is challenging because there may be exponential number of possible worlds for probabilistic database. Although several efficient algorithms are proposed in the literature, it is hard to mine frequent itemsets in large uncertain database due to the high time consuming. To address this issue, we propose an efficient algorithm to mine probabilistic frequent itemsets. A pruning strategy is also presented to accelerate the process of generating candidates. Extensive experiments have been done on synthetic and real databases, demonstrating that the proposed method preforms better than state-of-art methods in most cases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.