Abstract

High Utility Itemset Mining (HUIM) aids in the discovery of itemsets based on quantity and unit price of the items from a transactional database. Since its inception, HUIM has evolved as a generalized form of Frequent Itemset Mining (FIM). Unlike the support of an itemset which is antimonotone and is exploited in the algorithms for mining frequent itemsets, the utility measure is neither antimonotone nor monotone. This makes the problem of mining High Utility Itemsets (HUIs) interesting. In the current study, a novel method based on Dynamic Itemset Counting (DIC) has been proposed to optimize the Apriori-like Two-Phase (TP) algorithm for mining HUIs. Although, the TP algorithm uses antimonotonicity of Transaction Weighted Utility (TWU) of itemsets to prune the search space, the candidates are generated in a level-wise manner. This requires multiple database scans to test the candidates. The proposed method tests and generates higher order candidates at different stops during the database scan and segregates the itemsets for further evaluation. Experiments performed on real-time datasets show significant improvement in execution time of the DIC method when compared to the TP algorithm.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.