Effective Density Peaks Clustering Algorithm Based on the Layered K-Nearest Neighbors and Subcluster Merging

Chunhua Ren,Qishi Wu,Yang Yu,Linfu Sun

doi:10.1109/access.2020.3006069

Abstract

Density peaks clustering (DPC) algorithm is a novel density-based clustering algorithm, which is simple and efficient, is not necessary to specify the number of clusters in advance, and can find any nonspherical class clusters. However, DPC relies heavily on the calculation methods of the cutoff distance threshold and local density and cannot analyze complex manifold data, especially datasets with uneven density distribution and multiple peaks in the same cluster. To solve these problems, we propose an improved density peaks clustering algorithm based on the layered k-nearest neighbors and subcluster merging (LKSM_DPC). First, we redefine the local density calculation method using the layered k-nearest neighbors. To adapt to datasets with different densities, the k-nearest neighbors are divided into multiple layers. Second, for the multiple peaks in the same cluster problem, we design a new mechanism to calculate the similarity of subclusters based on the idea of shared neighbors and Newton's law of gravitation, and a subcluster merging strategy is proposed. To prove the effectiveness of our algorithm, we compare the LKSM_DPC with K-means, DBSCAN, DPC, and DPC derivatives for 24 datasets. A large number of experiments demonstrate that our algorithm can often outperform other algorithms.

Highlights

Clustering is one of the most important techniques in data mining
The literature has made contributions to the improvement of Density peaks clustering (DPC), there are still several problems: (1) most scholars used the idea of the k-nearest neighbors to calculate the local density, but few people considered the distribution of these k points, especially when the data density distribution is uneven; and (2) in a 2-D decision graph, it is difficult to determine the real cluster center, especially when there are multiple peaks in a cluster
To solve the above problems, in this paper, we proposed a novel density peaks clustering algorithm based on the layered k-nearest neighbors and subcluster merging (LKSM_DPC)

Summary

INTRODUCTION

Clustering is one of the most important techniques in data mining. This technique gathers data with similar characteristics into a cluster, and there are significant differences among different clusters [1], [2]. Cheng et al [25] addressed the problem that DPC cannot process manifold datasets He proposed an improved density peaks clustering algorithm based on shared-neighbors between local cores (LORE-DP) and redefined natural neighbor-based density and the newly defined graph-based distance. The literature has made contributions to the improvement of DPC, there are still several problems: (1) most scholars used the idea of the k-nearest neighbors to calculate the local density, but few people considered the distribution of these k points, especially when the data density distribution is uneven; and (2) in a 2-D decision graph, it is difficult to determine the real cluster center, especially when there are multiple peaks in a cluster.

DENSITY PEAKS CLUSTERING ALGORITHM

DENSITY PEAKS CLUSTERING BASED ON THE K-NEAREST NEIGHBORS

FUZZY WEIGHTED K-NEAREST NEIGHBORS DENSITY PEAKS CLUSTERING

OUR ALGORITHM

SIMILARITY AND SUBCLUSTER MERGING

EXPERIMENTS

Findings

CONCLUSIONS

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE access : practical innovations, open solutions	Publication Date: Jan 1, 2020
Citations: 16	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Effective Density Peaks Clustering Algorithm Based on the Layered K-Nearest Neighbors and Subcluster Merging

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions

Lead the way for us

Similar Papers

An improved density peaks clustering algorithm based on natural neighbor with a merging strategy
Shifei Ding ... Chao Li
Information sciences | VOL. 624
Shifei Ding, et. al.Shifei Ding ... Chao Li
30 Dec 2022
Information sciences | VOL. 624

Density Peak Clustering Algorithm Based on Optimal Density Radius
Yalu Liao ... Yaru Wang
-
Yalu Liao, et. al.Yalu Liao ... Yaru Wang
01 Jul 2018
01 Jul 2018

ConDPC: Data Connectivity-Based Density Peak Clustering
Yujuan Zou ... Zhijian Wang
Applied sciences | VOL. 12
Yujuan Zou, et. al.Yujuan Zou ... Zhijian Wang
13 Dec 2022
Applied sciences | VOL. 12

Density Peaks Clustering Based on Feature Reduction and Quasi-Monte Carlo
Zhihui Hu ... Jiangbo Qian
Scientific programming | VOL. 2022
Zhihui Hu, et. al.Zhihui Hu ... Jiangbo Qian
06 Jan 2022
Scientific programming | VOL. 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Effective Density Peaks Clustering Algorithm Based on the Layered K-Nearest Neighbors and Subcluster Merging

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions