Association rule mining for continuous attributes using genetic network programming

Karla Taboada,Jinglu Hu,Kotaro Hirasawa,Kaoru Shimada,Shingo Mabu

doi:10.1145/1276958.1277308

Abstract

Most association rule mining algorithms make use of discretization algorithms for handling continuous attributes. However, by means of methods of discretization, it is difficult to get highest attribute interdependency and at the same time to get lowest number of intervals. We propose a method using a new graph-based evolutionary algorithm named Network Programming (GNP) that can deal with continues values directly, that is, without using any discretization method as a preprocessing step. GNP is one of the evolutionary optimization techniques, which uses directed graph structures as solutions and is composed of three kinds of nodes: start node, judgment node and processing node. Once GNP is booted up, firstly the execution starts from the start node, secondly the next node to be executed is determined according to the judgment and connection from the current activated node. The features of GNP are described as follows. First, it is possible to reuse nodes; because of this, the structure is compact. Second, GNP can find solutions of problems without bloat, which can be sometimes found in Genetic Programming (GP), because of the fixed number of nodes in GNP. Third, nodes that are not used at the current program executions will be used for future evolution. Fourth, GNP is able to cope with partially observable Markov processes. In this paper, we propose a method that can deal with continuous attributes, where attributes in databases correspond to judgment nodes in GNP and each continuous attribute is checked whether its value is greater than a threshold value and the association rules are represented as the connections of the judgment nodes. Threshold ai is firstly determined by calculating the mean µi and standard deviation si of all attribute values of Ai. Then, initial threshold ai is selected randomly between the interval [µi - aisi, µi + aisi] where ai is a parameter to determine the range of the interval. Once the threshold ai is selected for all attributes, each value of the attribute Ai is checked if it is greater than the threshold ai in the judgment nodes of the proposed method. In addition to that, the threshold ai is also evolved by mutation between [µi - aisi, µi + aisi] in every generation in order to obtain as many association rules as possible. The features of the proposed method are as follows compared with other methods: 1) Extracts rules without identifying frequent itemsets used in Apriori-like mining methods. 2) Stores extracted important association rules in a pool all together through generations. 3) Measures the significance of associations via the chi-squared test. 4) Extracts important rules sufficient enough for user's purpose in a short time. 5) The pool is updated in every generation and only important association rules with higher chi-squared value are stored when the identical rules are stored. We have evaluated the proposed method by doing two simulations. Simulation 1 uses fixed threshold values; that is, they remain fixed at initial thresholds during evolution. In simulation 2,thresholds are evolved by mutation in every generation. Fig. 1 shows the number of rules extracted in the pool in simulation 2. It is found that the number of rules extracted has been increased, which means simulation 2 outperforms simulation 1.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Association rule mining for continuous attributes using genetic network programming

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Medical Association Rule Mining Using Genetic Network Programming
Kaoru Shimada ... Takayuki Furuzuki
IEEJ Transactions on Electronics, Information and Systems | VOL. 126
Kaoru Shimada, et. al.Kaoru Shimada ... Takayuki Furuzuki
01 Jan 2006
IEEJ Transactions on Electronics, Information and Systems | VOL. 126

Medical association rule mining using genetic network programming
Kaoru Shimada ... Takayuki Furuzuki
Electronics and Communications in Japan | VOL. 91
Kaoru Shimada, et. al.Kaoru Shimada ... Takayuki Furuzuki
01 Feb 2008
Electronics and Communications in Japan | VOL. 91

Mining association rules from databases with continuous attributes using genetic network programming
Karla Taboada ... Kotaro Hirasawa
-
Karla Taboada, et. al.Karla Taboada ... Kotaro Hirasawa
01 Sep 2007
01 Sep 2007

Generalized Time Related Sequential Association rule mining and traffic prediction
Huiyu Zhou ... Kotaro Hirasawa
-
Huiyu Zhou, et. al.Huiyu Zhou ... Kotaro Hirasawa
01 May 2009
01 May 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Association rule mining for continuous attributes using genetic network programming

Abstract

Talk to us

Similar Papers