Optimization of the Numeric and Categorical Attribute Weights in KAMILA Mixed Data Clustering Algorithm

Nádia Junqueira Martarelli,Marcelo Seido Nagano

doi:10.1007/978-3-030-33607-3_3

Abstract

The mixed data clustering algorithms have been timidly emerging since the end of the last century. One of the last algorithms proposed for this data-type has been KAMILA (KAy-means for MIxed LArge data) algorithm. While the KAMILA has outperformed the previous mixed data algorithms results, it has some gaps. Among them is the definition of numerical and categorical variable weights, which is a user-defined parameter or, by default, equal to one for all features. Hence, we propose an optimization algorithm called Biased Random-Key Genetic Algorithm for Features Weighting (BRKGAFW) to accomplish the weighting of the numerical and categorical variables in the KAMILA algorithm. The experiment relied on six real-world mixed data sets and two baseline algorithms to perform the comparison, which are the KAMILA with default weight definition, and the KAMILA with weight definition done by the traditional genetic algorithm. The results have revealed the proposed algorithm overperformed the baseline algorithms results in all data sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimization of the Numeric and Categorical Attribute Weights in KAMILA Mixed Data Clustering Algorithm

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Visual exploration of categorical and mixed data sets
Sara Johansson
-
Sara JohanssonSara Johansson
28 Jun 2009
28 Jun 2009

Determining the number of clusters using information entropy for mixed data
Jiye Liang ... Fuyuan Cao
Pattern Recognition | VOL. 45
Jiye Liang, et. al.Jiye Liang ... Fuyuan Cao
24 Dec 2011
Pattern Recognition | VOL. 45

Visual analysis of mixed data sets using interactive quantification
Sara Johansson ... Jimmy Johansson
ACM SIGKDD Explorations Newsletter | VOL. 11
Sara Johansson, et. al.Sara Johansson ... Jimmy Johansson
27 May 2010
ACM SIGKDD Explorations Newsletter | VOL. 11

Hinted Star Coordinates for Mixed Data
J Matute ... L Linsen
Computer Graphics Forum | VOL. 39
J Matute, et. al.J Matute ... L Linsen
15 May 2019
Computer Graphics Forum | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimization of the Numeric and Categorical Attribute Weights in KAMILA Mixed Data Clustering Algorithm

Abstract

Talk to us

Similar Papers