Abstract

K-modes is the scalable and efficient clustering algorithm to cluster the categorical data. The simple mismatching measure used in K-modes does not use the implicit relationship between the attribute values. This paper presents new weighted measures based on the domain of attribute values. The proposed measures were experimented with the datasets obtained from UCI data repository. External quality measures such as purity and F-measure is used to verify the efficiency of the clustering. The experimental results prove that the proposed measures are superior to original K-modes.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call