Abstract

Fuzzy association rules can reveal useful dependencies and interactions hidden in large gene expression data sets. However their derivation perplexes very difficult combinatorial problems that depend heavily on the size of these sets. The paper follows a divide and conquer approach to the problem that obtains computationally manageable solutions. Initially we cluster genes that more probably are associated. Thereafter, the fuzzy association rule extraction algorithms confront many but significantly reduced computationally problems that usually can be processed very fast. The clustering phase is accomplished by means of an approach based on mutual information (MI). This approach uses the mutual information as a similarity measure. However, the numerical evaluation of the MI is subtle. We experiment with the main methods and we compare between them. As the device that implements the mutual information clustering we use a SOM (Self-Organized Map) based approach that is capable of effectively incorporating supervised bias. After the mutual information clustering phase the fuzzy association rules are extracted locally on a per cluster basis. The paper presents an application of the techniques for mining the gene expression data. However, the presented techniques can easily be adapted and can be fruitful for intelligent exploration of any other similar data set as well.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call