Abstract

In this paper the modified version of cGAAM (a genetic algorithm for feature selection for clustering) is introduced. As it can be shown, the algorithm is able to find significant subsets of features in data sets that differ in size and number of classes. The common feature of the sets that were used to test the cGAAM is that the examples are provided with class labels. Due to this, although the clustering process was performed without the class labels, the chosen feature sets could be compared with feature subsets returned by Lasso method in terms of classification accuracy. The most important observation from the results presented in the paper is that the classification accuracy obtained with feature subsets returned by cGAAM was not only comparable with accuracy obtained with feature subsets returned by Lasso but almost always was higher than 80% (ionsphere dataset) and 90% (humanactivity dataset).

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.