A novel data clustering algorithm using heuristic rules based on [formula omitted]-nearest neighbors chain

Jianyun Lu,Qingsheng Zhu,Quanwang Wu

doi:10.1016/j.engappai.2018.03.014

Abstract

In practice, clustering algorithms usually suffer from the complex structure of the dataset, including data distribution and dimensionality. Meanwhile, the number of clusters, which is required as an input, is usually unavailable. In this paper, we propose a novel data clustering algorithm: it uses heuristic rules based on k-nearest neighbors chain and does not require the number of clusters as the input parameter. Inspired by the PageRank algorithm, we first use random walk model to measure the importance of data points. Then, on the basis of the important data points, we build a K-Nearest Neighbors Chain (KNNC) to order the k nearest neighbors by distance and propose two heuristic rules to find the proper number of clusters and initial clusters. The first heuristic rule is the gap of KNNC which reflects the degree of separation of clusters with convex shapes and the second one is the nearest neighbor gap of KNNC which reflects the inner compactness of a cluster. Comprehensive comparison results on synthetic and real datasets indicate that the proposed clustering algorithm can find the proper number of clusters and achieve comparable or even better performance than the popular clustering algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A novel data clustering algorithm using heuristic rules based on [formula omitted]-nearest neighbors chain

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence

Lead the way for us

Journal: Engineering Applications of Artificial Intelligence	Publication Date: Apr 18, 2018
Citations: 25

Similar Papers

A fast implementation of the ISOCLUS algorithm
N Memarsadeghi ... D.M Mount
-
N Memarsadeghi, et. al.N Memarsadeghi ... D.M Mount
21 Jul 2003
21 Jul 2003

An Improved Data Clustering Algorithm for Mining Web Documents
O H Odukoya ... G A Aderounmu
-
O H Odukoya, et. al.O H Odukoya ... G A Aderounmu
01 Dec 2010
01 Dec 2010

APs’ Virtual Positions-Based Reference Point Clustering and Physical Distance-Based Weighting for Indoor Wi-Fi Positioning
Weixing Xue ... Baoding Zhou
IEEE Internet of Things Journal | VOL. 5
Weixing Xue, et. al.Weixing Xue ... Baoding Zhou
01 Aug 2018
IEEE Internet of Things Journal | VOL. 5

An efficient clustering algorithm based on the k-nearest neighbors with an indexing ratio
Raneem Qaddoura ... Hossam Faris
International Journal of Machine Learning and Cybernetics | VOL. 11
Raneem Qaddoura, et. al.Raneem Qaddoura ... Hossam Faris
18 Nov 2019
International Journal of Machine Learning and Cybernetics | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A novel data clustering algorithm using heuristic rules based on [formula omitted]-nearest neighbors chain

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence