Abstract

Spectral clustering is a group of graph-based clustering methods in which the columns of the scaled cluster indicator matrix can be obtained by stacking the eigenvectors of the Laplacian matrix corresponding to the top c smallest eigenvalues ( c is the number of clusters). This leads to the possible existence of negative values in the scaled indicator matrix and therefore a post-processing step such as K means clustering or spectral rotation is necessary to get the discrete cluster assignments. Moreover, such obtained results lack of the interpretability for data points in the boundary area of multiple clusters. To simultaneously address both limitations, we propose a two-stage clustering model, termed FGC (fuzzy graph clustering) in this paper. In FGC, we first construct a doubly stochastic graph affinity matrix which is then approximated by the scaled product of the fuzzy cluster indicator matrices. The newly designed fuzzy cluster indicator matrix has two desirable properties of non-negativity and row normalization, which can bring us two benefits. On one hand, we can directly get the cluster assignment of a certain data point by checking the largest value in the corresponding row of the fuzzy cluster indicator matrix; and on the other hand, we can obtain the membership of each data point to different clusters. An iterative method under the alternative optimization framework is proposed to solve the objective function of FGC. We conduct data clustering experiments on both synthetic and benchmark data sets and the results demonstrate the effectiveness of our proposed FGC model.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call