Abstract

This paper proposes a novel k-means algorithm for clustering analysis for the cases that the true number of clusters in a data or points set is not known in advance. That is, assuming that the number of seed-points in the algorithm is set to be larger than the true number k of clusters in the data set, the proposed algorithm can assign the k seed-points to the actual clusters, respectively, with the extra seed-points corresponding to the empty clusters, i.e., having no winning points according to a newly defined distance. Via using the Mahalanobis distance, the proposed algorithm can be further extended to elliptical clustering analysis. It is demonstrated well by the experiments on simulated data set and the wine data that the proposed k- means algorithm can find the correct number of clusters in the sample data with a good correct classification rate. Moreover, the algorithm is successfully applied to unsupervised color image segmentation.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call