K-means-G*: Accelerating k-means clustering algorithm utilizing primitive geometric concepts

Hassan Ismkhan,Mohammad Izadi

doi:10.1016/j.ins.2022.11.001

Abstract

The k-means is the most popular clustering algorithm, but, as it needs too many distance computations, its speed is dramatically fall down against high-dimensional data. Although, there are some quite fast variants proposed in literature, but, there is still much room for improvement against high-dimensional large-scale datasets.What proposed here, k-means-g*, is based on a simple geometric concept. For four distinct points, if distance between all pairs except one pair are known, then, a lower bound can be determined for the unknown distance. Utilizing this technique in the assignment step of the k-means, many high-dimensional distance computations can be easily ignored, where small amount of memory is used.Both theoretical and experimental results approves speed of the k-means-g* against recently published fast variants. For over than 50 cases, out of 70 cases of performed experiments, it is faster than other algorithms. The C++ source code for k-means-g* is publicly available.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

K-means-G*: Accelerating k-means clustering algorithm utilizing primitive geometric concepts

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Journal: Information Sciences	Publication Date: Nov 7, 2022
Citations: 9

Similar Papers

Elliptical Descriptors: Some Simplified Morphometric Parameters for the Quantification of Complex Outlines
Matthieu Schmittbuhl ... André Schaaf
Mathematical Geology | VOL. 35
Matthieu Schmittbuhl, et. al.Matthieu Schmittbuhl ... André Schaaf
01 Oct 2003
Mathematical Geology | VOL. 35

Issues in large-angle scattering
Dennis Sivers
Annals of Physics | VOL. 90
Dennis SiversDennis Sivers
01 Mar 1975
Annals of Physics | VOL. 90

Using an Abstract Model for the Formal Specification of Interactive Graphic Systems
Juan Carlos Torres ... Buenaventura Clares
-
Juan Carlos Torres, et. al.Juan Carlos Torres ... Buenaventura Clares
01 Jan 1995
01 Jan 1995

An Intuitive Geometric Approach to the Gauss Markov Theorem
Leandro Da Silva Pereira ... Devanil Jaques De Souza
The American Statistician | VOL. 71
Leandro Da Silva Pereira, et. al.Leandro Da Silva Pereira ... Devanil Jaques De Souza
02 Jan 2017
The American Statistician | VOL. 71

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

K-means-G*: Accelerating k-means clustering algorithm utilizing primitive geometric concepts

Abstract

Talk to us

Similar Papers

More From: Information Sciences