A hybrid clustering technique combining a novel genetic algorithm with K-Means

Md Anisur Rahman,Md Zahidul Islam

doi:10.1016/j.knosys.2014.08.011

Abstract

Many existing clustering techniques including K-Means require a user input on the number of clusters. It is often extremely difficult for a user to accurately estimate the number of clusters in a data set. The genetic algorithms (GAs) generally determine the number of clusters automatically. However, they typically choose the genes and the number of genes randomly. If we can identify the right genes in the initial population then GAs have better possibility to produce a high quality clustering result than the case when we randomly choose the genes. We propose a novel GA based clustering technique that is capable of automatically finding the right number of clusters and identifying the right genes through a novel initial population selection approach. With the help of our novel fitness function, and gene rearrangement operation it produces high quality cluster centers. The centers are then fed into K-Means as initial seeds in order to produce an even higher quality clustering solution by allowing the initial seeds to readjust as needed. Our experimental results indicate a statistically significant superiority (according to the sign test analysis) of our technique over five recent techniques on twenty natural data sets used in this study based on six evaluation criteria.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A hybrid clustering technique combining a novel genetic algorithm with K-Means

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: Aug 20, 2014
Citations: 202

Similar Papers

Genetic Algorithm with an Improved Initial Population Technique for Automatic Clustering of Low-Dimensional Data
Xiangbing Zhou ... Fang Miao
Information | VOL. 9
Xiangbing Zhou, et. al.Xiangbing Zhou ... Fang Miao
21 Apr 2018
Information | VOL. 9

Defect clustering and classification for semiconductor devices
B Kundu ... K.P White
-
B Kundu, et. al.B Kundu ... K.P White
04 Aug 2002
04 Aug 2002

An unsupervised transfer learning approach to discover topics for online reputation management
Tamara Martín-Wanton ... Enrique Amigó
-
Tamara Martín-Wanton, et. al.Tamara Martín-Wanton ... Enrique Amigó
01 Jan 2013
01 Jan 2013

Genetic algorithm with healthy population and multiple streams sharing information for clustering
A.H Beg ... Vladimir Estivill-Castro
Knowledge-Based Systems | VOL. 114
A.H Beg, et. al.A.H Beg ... Vladimir Estivill-Castro
15 Oct 2016
Knowledge-Based Systems | VOL. 114

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A hybrid clustering technique combining a novel genetic algorithm with K-Means

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems