Abstract

The spectral clustering algorithm is extensively employed in different aspects, especially in the field of pattern recognition. However, the efficient construction of the neighborhood graph is the main reason for its promising results. Generally, the similarity matrix relies on the applied similarity measure between two data points, selection of k−nearest neighbors (KNN), and approach for the construction of a neighborhood graph. In this study, we integrate S-distance to spectral clustering, which is capable to find out the complex and non-linear cluster structures. Moreover, generalized mean distance-based KNN is proposed to decrease the sensitiveness towards the value of the k. Also, a symmetry-favored KNN method is applied to construct the neighborhood graph, which reduces the impact of outliers and noisy data points. However, spectral clustering faces scalability and speedup issues in the case of large size datasets. Thus, the proposed spectral clustering algorithm is also executed in distributed environments. Several experiments are performed to validate the proposed clustering algorithm on 20 real-world datasets and 3 large size datasets. Experimental results demonstrate that the proposed clustering algorithm outperforms some of the baseline methods in terms of accuracy and clustering error rates. Finally, we conduct Wilcoxon’s Rank-Sum test and illustrate that the proposed spectral clustering algorithm is statistically significant.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.