A PROPOSED ALGORITHM FOR DETERMINING THE OPTIMAL NUMBER OF CLUSTERS

Markela Muca ,Gleda Kutrolli ,Maksi Kutrolli

doi:10.19044/esj.2015.v11n36p%p

Abstract

Data clustering is a data exploration technique that allows objects with similar characteristics to be grouped together in order to facilitate their further processing. The K-means algorithm is a popular data-clustering algorithm. However, one of its drawbacks is the requirement for the number of clusters, K, to be specified before the algorithm is applied. This paper first reviews existing methods for selecting the number of clusters for the algorithm. Factors that affect this selection are then discussed and an improvement of the existing k-means algorithm to assist the selection is proposed. The paper concludes with an analysis of the results of using cluster validation referring to some measures that are classified as internal and external indexes to determine the optimal number of clusters for the K-means algorithm. There are applied some stopping criterion referring to those indexes for evaluating a clustering against a gold standart.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A PROPOSED ALGORITHM FOR DETERMINING THE OPTIMAL NUMBER OF CLUSTERS

Abstract

Talk to us

Similar Papers

More From: European Scientific Journal ESJ

Lead the way for us

Journal: European Scientific Journal ESJ	Publication Date: Dec 30, 2015
Citations: 1

Similar Papers

Dynamic parallel K-Means Algorithm Based On Dunn’s Index Method
Hitesh Kumari Yadav ... Sunil Dhankar
International Journal Of Engineering And Computer Science | VOL. 5
Hitesh Kumari Yadav, et. al.Hitesh Kumari Yadav ... Sunil Dhankar
29 Feb 2016
International Journal Of Engineering And Computer Science | VOL. 5

Development and validation of consensus clustering-based framework for brain segmentation using resting fMRI.
Srikanth Ryali ... Weidong Cai
Journal of neuroscience methods | VOL. 240
Srikanth Ryali, et. al.Srikanth Ryali ... Weidong Cai
29 Nov 2014
Journal of neuroscience methods | VOL. 240

Optimization of the number of clusters: a case study on multivariate quality control results of segment installation
S Mohammad E Hosseininasab ... Mohammad Javad Ershadi
The International Journal of Advanced Manufacturing Technology | VOL. 64
S Mohammad E Hosseininasab, et. al.S Mohammad E Hosseininasab ... Mohammad Javad Ershadi
15 Mar 2012
The International Journal of Advanced Manufacturing Technology | VOL. 64

How many clusters are best? - An experiment
Richard C Dubes
Pattern Recognition | VOL. 20
Richard C DubesRichard C Dubes
01 Jan 1987
Pattern Recognition | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A PROPOSED ALGORITHM FOR DETERMINING THE OPTIMAL NUMBER OF CLUSTERS

Abstract

Talk to us

Similar Papers

More From: European Scientific Journal ESJ