Max stable set problem to found the initial centroids in clustering problem

Awatif Karim,Youssef Hami,Jaouad Boumhidi,Chakir Loqman

doi:10.11591/ijeecs.v25.i1.pp569-579

Abstract

In this paper, we propose a new approach to solve the document-clustering using the K-Means algorithm. The latter is sensitive to the random selection of the k cluster centroids in the initialization phase. To evaluate the quality of K-Means clustering we propose to model the text document clustering problem as the max stable set problem (MSSP) and use continuous Hopfield network to solve the MSSP problem to have initial centroids. The idea is inspired by the fact that MSSP and clustering share the same principle, MSSP consists to find the largest set of nodes completely disconnected in a graph, and in clustering, all objects are divided into disjoint clusters. Simulation results demonstrate that the proposed K-Means improved by MSSP (KM_MSSP) is efficient of large data sets, is much optimized in terms of time, and provides better quality of clustering than other methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Max stable set problem to found the initial centroids in clustering problem

Abstract

Talk to us

Similar Papers

More From: Indonesian Journal of Electrical Engineering and Computer Science

Lead the way for us

Journal: Indonesian Journal of Electrical Engineering and Computer Science	Publication Date: Jan 1, 2022
License type: CC BY-NC 4.0

Similar Papers

NDPD: an improved initial centroid method of partitional clustering for big data mining
Kamlesh Kumar Pandey ... Diwakar Shukla
Journal of Advances in Management Research | VOL. 20
Kamlesh Kumar Pandey, et. al.Kamlesh Kumar Pandey ... Diwakar Shukla
23 Aug 2022
Journal of Advances in Management Research | VOL. 20

A pillar algorithm for K-means optimization by distance maximization for initial centroid designation
Ali Ridho Barakbah ... Yasushi Kiyoki
-
Ali Ridho Barakbah, et. al.Ali Ridho Barakbah ... Yasushi Kiyoki
01 Mar 2009
01 Mar 2009

An efficient k-means algorithm integrated with Jaccard distance measure for document clustering
Mushfeq-Us-Saleheen Shameem ... Raihana Ferdous
-
Mushfeq-Us-Saleheen Shameem, et. al.Mushfeq-Us-Saleheen Shameem ... Raihana Ferdous
01 Nov 2009
01 Nov 2009

Analysis of Variant Approaches for Initial Centroid Selection in K-Means Clustering Algorithm
N Sandhya ... M Raja Sekar
-
N Sandhya, et. al.N Sandhya ... M Raja Sekar
29 Oct 2017
29 Oct 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Max stable set problem to found the initial centroids in clustering problem

Abstract

Talk to us

Similar Papers

More From: Indonesian Journal of Electrical Engineering and Computer Science