A novel approach for initializing the spherical K-means clustering algorithm

Rehab Duwairi,Mohammed Abu-Rahmeh

doi:10.1016/j.simpat.2015.03.007

Abstract

In this paper, a novel approach for initializing the spherical K-means algorithm is proposed. It is based on calculating well distributed seeds across the input space. Also, a new measure for calculating vectors’ directional variance is formulated, to be used as a measure of clusters’ compactness. The proposed initialization scheme is compared with the classical K-means – where initial seeds are specified randomly or arbitrarily – on two datasets. The assessment was based on three measures: an objective function that measures intra cluster similarity, cluster compactness and time to converge. The proposed algorithm (called initialized K-means) outperforms the classical (random) K-means when intra cluster similarity and cluster compactness were considered for several values of k (number of clusters). As far as convergence time is concerned, the initialized K-means converges faster than the random K-means for small number of clusters. For a large number of clusters the time necessary to calculate the initial clusters’ seeds start to outweigh the convergence criterion in time. The exact number of clusters at which the proposed algorithm starts to change behavior is data dependent (=11 for dataset1 and=15 for dataset2).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A novel approach for initializing the spherical K-means clustering algorithm

Abstract

Talk to us

Similar Papers

More From: Simulation Modelling Practice and Theory

Lead the way for us

Journal: Simulation Modelling Practice and Theory	Publication Date: Apr 6, 2015
Citations: 92

Similar Papers

Extracting True Number of Clusters for Segmenting Image through Adaptive Finite Gaussian Mixture Model
...
VFAST Transactions on Software Engineering | VOL. 7
, et. al. ...
01 Feb 2019
VFAST Transactions on Software Engineering | VOL. 7

Finding the Number of Clusters in a Dataset Using an Information Theoretic Hierarchical Algorithm
M Aghagolzadeh ... B N Araabi
-
M Aghagolzadeh, et. al.M Aghagolzadeh ... B N Araabi
01 Dec 2006
01 Dec 2006

Object-based cluster validation with densities
Behnam Tavakkol ... Susan L Albin
Pattern Recognition | VOL. 121
Behnam Tavakkol, et. al.Behnam Tavakkol ... Susan L Albin
04 Aug 2021
Pattern Recognition | VOL. 121

Evidential evolving C-means clustering method based on artificial bee colony algorithm with variable strings and interactive evaluation mode
Zhi-Gang Su ... Hong-Yu Zhou
Fuzzy Optimization and Decision Making | VOL. 20
Zhi-Gang Su, et. al.Zhi-Gang Su ... Hong-Yu Zhou
06 Oct 2020
Fuzzy Optimization and Decision Making | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A novel approach for initializing the spherical K-means clustering algorithm

Abstract

Talk to us

Similar Papers

More From: Simulation Modelling Practice and Theory