Abstract

Clustering by fast search and find of density peaks (herein called FDPC), as a recently proposed density-based clustering algorithm, has attracted the attention of many researchers since it can recognize arbitrary-shaped clusters. In addition, FDPC needs only one parameter $$d_c$$ and identifies the number of clusters by decision graph. Nevertheless, it is not clear how to find a proper $$d_c$$ for a given data set and such a perfect parameter may not exist in practice for the multi-scale data set. In this paper, we proposed a modified PageRank algorithm to compute the local density for each data point which is more robust than Gaussian kernel and cutoff method. Besides, FDPC yields poor results on the random distribution data sets since there may be several maxima for one cluster. To solve this problem, we proposed an improved minimax similarity method. Comparing our proposed approach with FDPC on some artificial and real-life data sets, the experimental results indicate that our proposed approach outperforms FDPC in terms of accuracy.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.