ANDClust: An Adaptive Neighborhood Distance‐Based Clustering Algorithm to Cluster Varying Density and/or Neck‐Typed Datasets

Ali Şenol

doi:10.1002/adts.202301113

Abstract

AbstractAlthough density‐based clustering algorithms can successfully define clusters in arbitrary shapes, they encounter issues if the dataset has varying densities or neck‐typed clusters due to the requirement for precise distance parameters, such as eps parameter of DBSCAN. These approches assume that data density is homogenous, but this is rarely the case in practice. In this study, a new clustering algorithm named ANDClust (Adaptive Neighborhood Distance‐based Clustering Algorithm) is proposed to handle datasets with varying density and/or neck‐typed clusters. The algorithm consists of three parts. The first part uses Multivariate Kernel Density Estimation (MulKDE) to find the dataset's peak points, which are the start points for the Minimum Spanning Tree (MST) to construct clusters in the second part. Lastly, an Adaptive Neighborhood Distance (AND) ratio is used to weigh the distance between the data pairs. This method enables this approach to support inter‐cluster and intra‐cluster density varieties by acting as if the distance parameter differs for each data of the dataset. ANDClust is tested on synthetic and real datasets to reveal its efficiency. The algorithm shows superior clustering quality in a good run‐time compared to its competitors. Moreover, ANDClust could effectively define clusters of arbitrary shapes and process high‐dimensional, imbalanced datasets may have outliers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ANDClust: An Adaptive Neighborhood Distance‐Based Clustering Algorithm to Cluster Varying Density and/or Neck‐Typed Datasets

Abstract

Talk to us

Similar Papers

More From: Advanced Theory and Simulations

Lead the way for us

Journal: Advanced Theory and Simulations	Publication Date: Mar 8, 2024
License type: CC BY-NC 4.0

Similar Papers

A multi-prototype clustering algorithm based on minimum spanning tree
Ting Luo ... Xia Sun
-
Ting Luo, et. al.Ting Luo ... Xia Sun
01 Aug 2010
01 Aug 2010

CHSMST:A Clustering algorithm based on hyper surface and Minimum Spanning Tree
Qing He ... Wei-Zhong Zhao
-
Qing He, et. al. Qing He ... Wei-Zhong Zhao
01 Jul 2008
01 Jul 2008

A Hybrid Clustering Algorithm
Sheng-Yi Jiang ... Xia Li
-
Sheng-Yi Jiang, et. al.Sheng-Yi Jiang ... Xia Li
01 Jan 2009
01 Jan 2009

Clustering with Local Density Peaks-Based Minimum Spanning Tree
Dongdong Cheng ... Jinlong Huang
IEEE Transactions on Knowledge and Data Engineering | VOL. 33
Dongdong Cheng, et. al.Dongdong Cheng ... Jinlong Huang
01 Feb 2021
IEEE Transactions on Knowledge and Data Engineering | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ANDClust: An Adaptive Neighborhood Distance‐Based Clustering Algorithm to Cluster Varying Density and/or Neck‐Typed Datasets

Abstract

Talk to us

Similar Papers

More From: Advanced Theory and Simulations