Abstract

Data mining extracts useful knowledge from big data. The extracted knowledge in data mining is often represented by association rules, and association rules can be also used for classification. However, when association rules for classification (called class association rules) are extracted, a large number of data with class labels are necessary, which requires a lot of cost of manual annotation. Therefore, this paper proposes a semisupervised learning method for rule extraction, where a small number of labeled data and a large number of unlabeled data are used to efficiently extract class association rules. In detail, this paper proposes two types of classifiers using class association rules: one is a classifier based on semisupervised learning, and the other is that based on both supervised and semisupervised learning. The second method builds several classifiers using supervised learning and semisupervised learning and the classification results of these classifiers are integrated to make the final decision. As an association rule mining method, Genetic Network Programming (GNP), which is one of the graph‐based evolutionary algorithms is used. GNP has shown distinguished classification ability in some applications; therefore, one of the other objectives of this paper is to extend the applicability of GNP to data mining based on semisupervised learning. In the experiments, the classification accuracy is evaluated using some benchmark datasets by comparing with some conventional methods. © 2020 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call