Variable Weighting in Fuzzy k-Means Clustering to Determine the Number of Clusters

Imran Khan,Waseem Shahzad,Zongwei Luo,Joshua Zhexue Huang

doi:10.1109/tkde.2019.2911582

Abstract

One of the most significant problems in cluster analysis is to determine the number of clusters in unlabeled data, which is the input for most clustering algorithms. Some methods have been developed to address this problem. However, little attention has been paid on algorithms that are insensitive to the initialization of cluster centers and utilize variable weights to recover the number of clusters. To fill this gap, we extend the standard fuzzy $k$ k -means clustering algorithm. It can automatically determine the number of clusters by iteratively calculating the weights of all variables and the membership value of each object in all clusters. Two new steps are added to the fuzzy $k$ k -means clustering process. One of them is to introduce a penalty term to make the clustering process insensitive to the initial cluster centers. The other one is to utilize a formula for iterative updating of variable weights in each cluster based on the current partition of data. Experimental results on real-world and synthetic datasets have shown that the proposed algorithm effectively determined the correct number of clusters while initializing the different number of cluster centroids. We also tested the proposed algorithm on gene data to determine a subset of important genes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Variable Weighting in Fuzzy k-Means Clustering to Determine the Number of Clusters

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Sep 1, 2020
Citations: 85

Similar Papers

Regionalisation of Watersheds Using Fuzzy C Means Clustering Algorithm in the West Flowing River of Kerala
Thottungal Krishnankutty Drissia ... Alayil Bahuleyan Anitha
-
Thottungal Krishnankutty Drissia, et. al.Thottungal Krishnankutty Drissia ... Alayil Bahuleyan Anitha
01 Jan 2020
01 Jan 2020

Determination of the Initialization Number of Clusters inK-means Clustering Application Using Co-OccurrenceStatistics Techniques for Multispectral Satellite Imagery
Kitti Koonsanit
International Journal of Information and Electronics Engineering | VOL. -
Kitti KoonsanitKitti Koonsanit
01 Jan 2012
International Journal of Information and Electronics Engineering | VOL. -

Automatically Determining the Number of Clusters in Unlabeled Data Sets
Liang Wang ... Christopher Leckie
IEEE Transactions on Knowledge and Data Engineering | VOL. 21
Liang Wang, et. al.Liang Wang ... Christopher Leckie
01 Mar 2009
IEEE Transactions on Knowledge and Data Engineering | VOL. 21

Modified K-means algorithm for automatic stimation of number of clusters using advanced visual assessment of cluster tendency
D Sharmilarani ... G Komarasamy
-
D Sharmilarani, et. al.D Sharmilarani ... G Komarasamy
01 Jan 2014
01 Jan 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Variable Weighting in Fuzzy k-Means Clustering to Determine the Number of Clusters

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering