Aiming at the defects of traditional density-based uncertainty clustering algorithms, such as parameter sensitivity and poor clustering results for complex manifold uncertain data sets, a new uncertainty data density peak clustering algorithm based on JS divergence (UDPC-JS) is proposed. The algorithm first removes noise points by using the uncertain natural neighbor density factor defined by the uncertain natural neighbor. Secondly, the local density of uncertain data objects is calculated by combining the uncertain natural neighbor and JS divergence. The initial clustering center of the uncertain data set is found by combining the idea of representative points, and the distance between the initial clustering centers is defined based on JS divergence and graph. Then, the decision graph is constructed on the initial clustering center using the local density calculated based on the uncertain natural neighbor and JS divergence and the newly defined distance between the initial clustering centers based on JS divergence and graph, and the final clustering center is selected according to the decision graph. Finally, the unassigned uncertain data objects are assigned to the cluster where their initial clustering center is located. Experimental results show that the algorithm has better clustering effect and accuracy than the comparison algorithm, and has a greater advantage in processing complex manifold uncertain data sets.
Read full abstract