Abstract
The density peaks clustering (DPC) algorithm is not sensitive to the recognition of halo nodes. The halo nodes at the edge of the density peaks clustering algorithm has a lower local density. The outliers are distributed in halo nodes. The novel halo identification method based on density peaks clustering algorithm utilize the advantage of DBSCAN algorithm to quickly identify outliers, which improved the sensitivity to halo nodes. However, the identified halo nodes cannot be effectively assigned to adjacent clusters. Therefore, this paper will use K-nearest neighbor (KNN) algorithm to classify the identified halo nodes. K-nearest neighbor is the simplest and most efficient classification method. The KNN algorithm has the advantages of high accuracy, insensitivity to outliers and no input hypothesis data. Hence, we proposed a novel density peaks clustering halo node assignment algorithm based on K-nearest neighbor theory (KNN-HDPC). KNN-HDPC can grasp the internal relations between outliers and cluster nodes more deeply, so as to dig out the deeper relations between nodes. Experimental results demonstrate that the proposed algorithm can effectively cluster and reclassify a large number of complex data. We can quickly dig out the potential relationship between noise points and cluster points. The improved algorithm has higher clustering accuracy than the original DPC algorithm, and essentially has more robust clustering results.
Highlights
Clustering analysis can be effectively classified from sample data
In the original density peaks clustering (DPC) algorithm, this parameter was determined by 1 to 2 percent of the node distance sorted in ascending order
We can see from the simulation clustering process diagram that the new algorithm K-nearest neighbor (KNN)-HDPC can master the inherent similarity between halo nodes and adjacent categories
Summary
Clustering analysis can be effectively classified from sample data. Different methods used in clustering analysis often lead to different conclusions [1]. We proposed an improvement recognition method on halo node for density peaks clustering algorithm (HaloDPC). The density connection relationship in DBSCAN clustering and the conceptual model of structural similarity in SCAN algorithm are introduced, so that HaloDPC algorithm is more efficiently than the original DPC clustering. It improved the processing ability of the algorithm for arbitrary shape data sets and irregular data, and improved its ability to detect outliers and intermediate nodes. We proposed an improvement recognition method on halo node for density peaks clustering algorithm based on KNN theory (KNN-HDPC).
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.