A two-stage density clustering algorithm

Min Wang,Lei Gao,Fan Min,Li-Ping Deng,Ying-Yi Zhang

doi:10.1007/s00500-020-05028-x

Abstract

Clustering by fast search and find of density peaks (CFDP) is a popular density-based algorithm. However, it is criticized because it is inefficient and applicable only to some types of data, and requires the manual setting of the key parameter. In this paper, we propose the two-stage density clustering algorithm, which takes advantage of granular computing to address the aforementioned issues. The new algorithm is highly efficient, adaptive to various types of data, and requires minimal parameter setting. The first stage uses the two-round-means algorithm to obtain $$\sqrt{n}$$ small blocks, where n is the number of instances. This stage decreases the data size directly from n to $$\sqrt{n}$$ . The second stage constructs the master tree and obtains the final blocks. This stage borrows the structure of CFDP, while the cutoff distance parameter is not required. The time complexity of the algorithm is $$O(mn^\frac{3}{2})$$ , which is lower than $$O (mn^2)$$ for CFDP. We report the results of some experiments performed on 21 datasets from various domains to compare a new clustering algorithm with some state-of-the-art clustering algorithms. The results demonstrated that the new algorithm is adaptive to different types of datasets. It is two or more orders of magnitude faster than CFDP.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A two-stage density clustering algorithm

Abstract

Talk to us

Similar Papers

More From: Soft Computing

Lead the way for us

Journal: Soft Computing	Publication Date: May 26, 2020
Citations: 8

Similar Papers

A Hybrid Clustering Algorithm
Sheng-Yi Jiang ... Xia Li
-
Sheng-Yi Jiang, et. al.Sheng-Yi Jiang ... Xia Li
01 Jan 2009
01 Jan 2009

On a two-stage progressive clustering algorithm with graph-augmented density peak clustering
Xinzheng Niu ... Chase Q Wu
Engineering Applications of Artificial Intelligence | VOL. 108
Xinzheng Niu, et. al.Xinzheng Niu ... Chase Q Wu
08 Dec 2021
Engineering Applications of Artificial Intelligence | VOL. 108

Heterogeneous cryo-EM projection image classification using a two-stage spectral clustering based on novel distance measures.
Xiangwen Wang ... Xianghong Lin
Briefings in bioinformatics | VOL. 23
Xiangwen Wang, et. al.Xiangwen Wang ... Xianghong Lin
07 Mar 2022
Briefings in bioinformatics | VOL. 23

Research on batching strategy of medical orders based on Canopy-K-means two-stage clustering algorithm
Yufeng Zhuang ... Jingwen Han
-
Yufeng Zhuang, et. al.Yufeng Zhuang ... Jingwen Han
11 Sep 2020
11 Sep 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A two-stage density clustering algorithm

Abstract

Talk to us

Similar Papers

More From: Soft Computing