Abstract

Analyzing the fast search and find of density peaks clustering (DPC) algorithm, we find that the cluster centers cannot be determined automatically and that the selected cluster centers may fall into a local optimum and the random selection of the parameter cut-off distance ${d_{c}}$ value. To overcome these problems, a novel clustering algorithm based on DPC & PSO (PDPC) is proposed. Particle swarm optimization (PSO) is introduced because of its simple concept and strong global search ability, which can find the optimal solution in relatively few iterations. First, to solve the effect of the selection of the parameter ${d_ {c}} $ on the calculation density and the clustering results, this paper proposes a method to calculate that parameter. Second, a new fitness criterion function is proposed that iteratively searches ${K} $ global optimal solutions through the PSO algorithm, that is, the initial cluster centers. Third, each sample is assigned to ${K} $ initial center points according to the minimum distance principle. Finally, we update the cluster centers and redistribute the remaining objects to the clusters closest to the cluster centers. Furthermore, the effectiveness of the proposed algorithm is verified on nine typical benchmark data sets. The experimental results show that the PDPC can effectively solve the problem of cluster center selection in the DPC algorithm, avoiding the subjectivity of the manual selection process and overcoming the influence of the parameter ${d_{c}}$ . Compared with the other six algorithms, the PDPC algorithm has a stronger global search ability, higher stability and a better clustering effect.

Highlights

  • In 2014, Alex Rodriguez et al proposed a new algorithm, the clustering by fast search and find of density peaks (DPC) algorithm [1]

  • A novel clustering algorithm based on density peaks clustering (DPC) & Particle swarm optimization (PSO) (PDPC) is proposed

  • When calculating the density of data points, dc does not need to be randomly selected according to the empirical value; (2) this study introduces the PSO intelligent optimization algorithm because it has strong global search ability, which prevents the cluster centers selected by the DPC algorithm from falling into a local optimum; (3) a new fitness function is proposed based on the DPC algorithm, which iteratively searches K global optimal solutions by the PSO algorithm, that is, the initial cluster centers

Read more

Summary

INTRODUCTION

In 2014, Alex Rodriguez et al proposed a new algorithm, the clustering by fast search and find of density peaks (DPC) algorithm [1]. The advantages of the DPC clustering algorithm are outstanding, but its disadvantages are obvious. A new fitness function based on the DPC algorithm is proposed, and a method for calculating the parameter dc is proposed. On these bases, a novel clustering algorithm based on DPC & PSO (PDPC) is proposed. The effectiveness and advantages of the PDPC algorithm are verified by experiments on typical benchmark data sets. Experiments show that our algorithm can effectively solve the problem of cluster center selection in the DPC algorithm, avoiding the subjectivity of the manual selection process and overcoming the influence of the parameter dc

MOTIVATIONS The motivation of this study can be summarized as follows:
RELATED WORKS
DENSITY PEAK CLUSTERING ALGORITHM
PDPC CLUSTERING ALGORITHM
SETTING THE PARAMETER
EXPERIMENTAL RESULTS AND DISCUSSION
TEST CONVERGENCE OF THE PDPC ALGORITHM
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call