Abstract

Fuzzy cluster analysis plays an essential role in addressing unclear boundaries between clusters in data and aims to group objects into fuzzy clusters based on their similarities. In this paper, we propose a new method for fuzzy clustering of data with categorical attributes. Specifically, we first introduce a method for kernel-based representation of cluster centers in which the underlying distribution of categorical values within a cluster center is estimated as a weighted sum of the uniform distribution and their frequency distribution. We then extend the k-centers clustering method by applying this newly proposed method of cluster center presentation for fuzzy clustering of categorical data. The effectiveness and efficiency of the proposed method are demonstrated by conducting experiments on 16 realworld datasets and comparing the results with those of existing methods. In addition, our research can be regarded as the first attempt to apply a fuzzy silhouette scoring method that includes internal coherence and external separation of fuzzy clusters into clustering of categorical data.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call