Application and Analysis of Hypergraph Association Rule Redundancy Algorithm in Data Mining

Huanli Pang,Lianzhe Zhou,Xuefeng Zhang

doi:10.1155/2022/1193586

Huanli Pang, Lianzhe Zhou + Show 1 more

Open Access

https://doi.org/10.1155/2022/1193586

Copy DOI

Journal: Mobile Information Systems	Publication Date: Oct 11, 2022
Citations: 1	License type: CC BY 4.0

Affiliation: Changchun University of Technology

Abstract

In order to realize “from individual data research to data system research” and “from passive data verification to active discovery,” this study proposes a hypergraph-based association rule redundancy processing algorithm in data mining. This study introduces the concepts of hypergraph and system, explores the establishment of hypergraph on a three-dimensional matrix model, and adopts a new hyperedge definition method according to the characteristics of big data and the concept of the system, which improves the ability to deal with problems; the association rules are transformed into a directed hypergraph, and the adjacency matrix is redefined. The detection of redundancy and loops is transformed into the processing of connected blocks and cycles in the hypergraph. The experimental results show that two UCI datasets were selected, namely, the balloons dataset and the shuttle landing control dataset, in which the minimum support and minimum confidence of the balloons dataset are both 5%. The dataset has 4 attributes, and 18 association rules are obtained through the Aprior algorithm. Although the running time of the coevolution algorithm is slightly longer than that of the other two global optimization algorithms, the running time is completely within the acceptable range. Moreover, due to the effective introduction of the idea of coevolution, compared with the use of the other two algorithms for association rule mining, it not only has a better mining quality but also has a significant advantage in the ability to jump out of the local optimal solution, realizing the search of high-quality association rules in high-dimensional datasets. Conclusion. This model provides a new idea and method for the redundant processing of association rules.

Full Text