An efficient distance estimation and centroid selection based on k-means clustering for small and large dataset

Girdhar Gopal Ladha,Ravi Kumar Singh Pippal

doi:10.19101/ijatee.2020.762109

Abstract

In this paper an efficient distance estimation and centroid selection based on k-means clustering for small and large dataset. Data pre-processing was performed first on the dataset. For the complete study and analysis PIMA Indian diabetes dataset was considered. After pre-processing distance and centroid estimation was performed. It includes initial selection based on randomization and then centroids updations were performed till the iterations or epochs determined. Distance measures used here are Euclidean distance (Ed), Pearson Coefficient distance (PCd), Chebyshev distance (Csd) and Canberra distance (Cad). The results indicate that all the distance algorithms performed approximately well in case of clustering but in terms of time Cad outperforms in comparison to other algorithms.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An efficient distance estimation and centroid selection based on k-means clustering for small and large dataset

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Technology and Engineering Exploration

Lead the way for us

Journal: International Journal of Advanced Technology and Engineering Exploration	Publication Date: Dec 31, 2020
Citations: 2

Similar Papers

Iris recognition based on distance similarity and PCA
Yuslena Sari ... Muhammad Alkaff
-
Yuslena Sari, et. al.Yuslena Sari ... Muhammad Alkaff
01 Jan 2018
01 Jan 2018

Some Intuitionist Fuzzy Weighted Geometric Distance Measures and Their Application to Group Decision Making
Bo Peng ... Shouzhen Zeng
International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems | VOL. 22
Bo Peng, et. al.Bo Peng ... Shouzhen Zeng
01 Oct 2014
International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems | VOL. 22

Clustering Performance Analysis of the K-Medoids Algorithm for Improved Fingerprint-Based Localization
Abdulmalik Yaro ... Maly Filip
Jordan Journal of Electrical Engineering | VOL. 10
Abdulmalik Yaro, et. al.Abdulmalik Yaro ... Maly Filip
01 Jan 2024
Jordan Journal of Electrical Engineering | VOL. 10

Comparison of Distance Metrics for Generating Cluster-based Ensemble Learning
Lenny Putri Yulianti ... Judhi Santoso
-
Lenny Putri Yulianti, et. al.Lenny Putri Yulianti ... Judhi Santoso
23 Feb 2023
23 Feb 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An efficient distance estimation and centroid selection based on k-means clustering for small and large dataset

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Technology and Engineering Exploration