Effective Clustering Analysis Based on New Designed CVI and Improved Clustering Algorithms

Erzhou Zhu,Xuejun Li,Futian Wang,Feng Liu,Peng Wen,Binbin Zhu

doi:10.1109/bdcloud.2018.00115

Abstract

Due to different settings of the parameters and random selection of initial clustering centers, the traditional K-means algorithm is not stable. Clustering validity index (CVI) is an important method for evaluating the effect of clustering results generated by clustering algorithms. However, many of the existing CVIs suffer from instability, narrow range of applications and cannot properly process datasets with non-spherical distribution and datasets with a large number of overlapping points. Aiming at these problems, the traditional K-means algorithm is firstly improved by utilizing the dynamic average distance to find the initial clustering centers rather than selecting them randomly. Then, based on the idea of dynamic average distance, a new clustering validity index, DCVI, is proposed. The new DCVI is able to deal with many kinds of datasets includes non-convex datasets and datasets with a large number of overlapping points. Thirdly, by integrating the improved K-means algorithm with the new DCVI, a new algorithm (KVOA) is designed to optimize and determine the optimal clustering number (Kopt) for a wide range of datasets. The experimental results on testing several datasets have demonstrated that the improved K-means algorithm is more accurately and stably than the traditional ones. Meanwhile, the new DCVI is compared with six commonly used CVIs. The experimental results show that our new DCVI is more accurately and stably than the other CVIs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Effective Clustering Analysis Based on New Designed CVI and Improved Clustering Algorithms

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

DP-Kmeans and Beyond: Optimal Clustering with a new Clustering Validity Index
Zhu-Juan Ma Zhu-Juan Ma ... Feng Liu Xiang-Hua Chen
電腦學刊 | VOL. 33
Zhu-Juan Ma Zhu-Juan Ma, et. al.Zhu-Juan Ma Zhu-Juan Ma ... Feng Liu Xiang-Hua Chen
01 Oct 2022
電腦學刊 | VOL. 33

Improved initial cluster center selection in K-means clustering
Minchen Zhu ... Steven D Prior
Engineering Computations | VOL. 31
Minchen Zhu, et. al.Minchen Zhu ... Steven D Prior
28 Oct 2014
Engineering Computations | VOL. 31

A Network Intrusion Detection Method Based on Improved K-means Algorithm
Meng Gao ... Nihong Wang
-
Meng Gao, et. al.Meng Gao ... Nihong Wang
24 Aug 2014
24 Aug 2014

The Preprocessing Method of Control Points in Geometric Correction for UAV Remote Sensing Image
Lirong Diao ... Tingting Chen
-
Lirong Diao, et. al.Lirong Diao ... Tingting Chen
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Effective Clustering Analysis Based on New Designed CVI and Improved Clustering Algorithms

Abstract

Talk to us

Similar Papers