Clustering algorithms and validity measures

M Halkidi,M Vazirgiannis,Y Batistakis

doi:10.1109/ssdm.2001.938534

Abstract

Clustering aims at discovering groups and identifying interesting distributions and patterns in data sets. Researchers have extensively studied clustering since it arises in many application domains in engineering and social sciences. In the last years the availability of huge transactional and experimental data sets and the arising requirements for data mining created needs for clustering algorithms that scale and can be applied in diverse domains. The paper surveys clustering methods and approaches available in the literature in a comparative way. It also presents the basic concepts, principles and assumptions upon which the clustering algorithms are based. Another important issue is the validity of the clustering schemes resulting from applying algorithms. This is also related to the inherent features of the data set under concern. We review and compare clustering validity measures available in the literature. Furthermore, we illustrate the issues that are under-addressed by the recent algorithms and we address new research directions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Clustering algorithms and validity measures

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

On Clustering Validation Techniques
Maria Halkidi ... Yannis Batistakis
Journal of Intelligent Information Systems | VOL. 17
Maria Halkidi, et. al.Maria Halkidi ... Yannis Batistakis
01 Jan 2001
Journal of Intelligent Information Systems | VOL. 17

A new fuzzy clustering algorithm for optimally finding granular prototypes
Ying Xie ... Xiaoquan Zhao
International Journal of Approximate Reasoning | VOL. 40
Ying Xie, et. al.Ying Xie ... Xiaoquan Zhao
07 Jan 2005
International Journal of Approximate Reasoning | VOL. 40

A Data Set Oriented Approach for Clustering Algorithm Selection
Maria Halkich ... Michalis Vazirgiannis
-
Maria Halkich, et. al.Maria Halkich ... Michalis Vazirgiannis
01 Jan 2001
01 Jan 2001

Towards understanding hierarchical clustering: A data distribution perspective
Junjie Wu ... Jian Chen
Neurocomputing | VOL. 72
Junjie Wu, et. al.Junjie Wu ... Jian Chen
07 Jan 2009
Neurocomputing | VOL. 72

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Clustering algorithms and validity measures

Abstract

Talk to us

Similar Papers