Efficient Density-Based Partitional Clustering Algorithm

Zareen Alamgir,Hina Naveed

doi:10.31577/cai_2021_6_1322

Abstract

Clustering is an important data mining technique that helps to detect hidden structures and patterns in the data. K-means algorithm is one of the most popular and widely used partitional clustering algorithms. It is a simple and efficient method but has several shortcomings. One major drawback of traditional K-means is that it selects initial centroids randomly, resulting in low-quality clusters. Various K-means extensions are designed to solve the issue of the random initial centroid. A novel density-based K-means (DK-means) algorithm is recently proposed that uses density-parameters for selecting initial centroids. It outperforms K-means in terms of accuracy at the cost of time. In this research, we present an efficient density-based K-means algorithm (EDK-means) that uses advance data structures and significantly reduces the DK-means algorithm's execution time. Furthermore, we rigorously evaluated the performance of density-based K-means on different challenging real-world datasets and compared it with traditional K-means. The experimental results are promising and show that density-based K-means outperforms K-means. It converges more rapidly than basic K-means, and it works well for the datasets with different cluster sizes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Density-Based Partitional Clustering Algorithm

Abstract

Talk to us

Similar Papers

More From: Computing and Informatics

Lead the way for us

Journal: Computing and Informatics	Publication Date: Jan 1, 2021
Citations: 3

Similar Papers

An approach for document clustering using PSO and K-means algorithm
Rashmi Chouhan ... Anuradha Purohit
-
Rashmi Chouhan, et. al.Rashmi Chouhan ... Anuradha Purohit
01 Jan 2018
01 Jan 2018

A Density-Based k-Means++ Algorithm for Imbalanced Datasets Clustering
Linchuan Fan ... Yanxia Li
-
Linchuan Fan, et. al.Linchuan Fan ... Yanxia Li
08 Sep 2019
08 Sep 2019

Genetically Improved PSO Algorithm for Efficient Data Clustering
Rehab F Abdel-Kader
-
Rehab F Abdel-KaderRehab F Abdel-Kader
01 Jan 2009
01 Jan 2009

New Approach for K-mean and K-medoids Algorithm
Abhishek Patel ... Purnima Singh
International Journal of Computer Applications Technology and Research | VOL. 2
Abhishek Patel, et. al.Abhishek Patel ... Purnima Singh
10 Jan 2012
International Journal of Computer Applications Technology and Research | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Density-Based Partitional Clustering Algorithm

Abstract

Talk to us

Similar Papers

More From: Computing and Informatics