Clusterization by the K-means method when K is unknown

Natalya Litvinenko,Mussa Turdalyuly,Assem Shayakhmetova,Orken Mamyrbayev,N Bardis

doi:10.1051/itmconf/20192401013

Natalya Litvinenko, Mussa Turdalyuly + Show 3 more

Open Access

https://doi.org/10.1051/itmconf/20192401013

Copy DOI

Abstract

There are various methods of objects’ clusterization used in different areas of machine learning. Among the vast amount of clusterization methods, the K-means method is one of the most popular. Such a method has as pros as cons. Speaking about the advantages of this method, we can mention the rather high speed of objects clusterization. The main disadvantage is a necessity to know the number of clusters before the experiment. This paper describes the new way and the new method of clusterization, based on the K-means method. The method we suggest is also quite fast in terms of processing speed, however, it does not require the user to know in advance the exact number of clusters to be processed. The user only has to define the range within which the number of clusters is located. Besides, using suggested method there is a possibility to limit the radius of clusters, which would allow finding objects that express the criteria of one cluster in the most distinctive and accurate way, and it would also allow limiting the number of objects in each cluster within the certain range.

Highlights

Nowadays artificial intelligence is a very popular tool in various fields of science - economics, public life and production
Bayesian networks are widely used in such areas as economics, psychology, sociology, medicine, genetics, management theory, etc
We have described two types of the Bayesian networks’ learning below: Probabilistic aspects of machine learning and the basis of various algorithms used in machine learning are described in [9]

Summary

Introduction

Nowadays artificial intelligence is a very popular tool in various fields of science - economics, public life and production. Sometimes researcher may have some doubts about the necessity of determining Bayesian networks’ separate nodes individual nodes of a Bayesian network These problems are solved by involving specialists in the studied area. The processing of big amount of data in the construction of Bayesian networks significantly complicates an already difficult task This implies new problems in the computational area but in mathematics. Under Bayesian networks’ controlled learning we usually understood as different ways to determine the probabilistic characteristics of the separate nodes’ variables of the network, as well as the probabilistic dependencies between separate nodes based on some array of experimental data. Under Bayesian networks’ uncontrolled learning we usually understood methods of defining new nodes of the network and new dependencies between nodes based on some array of experimental data. We assume to apply this software to grant project «Development and software implementation of a package for solving applied problems in Bayesian networks»

Problem statement

Stages of clusterization algorithm

Findings

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ITM Web of Conferences	Publication Date: Jan 1, 2019
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Clusterization by the K-means method when K is unknown

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ITM Web of Conferences

Lead the way for us

Similar Papers

Optimization of the Number of Clusters of the K-Means Method in Grouping Egg Production Data in Indonesia
Solikhun Solikhun ... Verdi Yasin
International Journal of Artificial Intelligence & Robotics (IJAIR) | VOL. 4
Solikhun Solikhun, et. al.Solikhun Solikhun ... Verdi Yasin
02 Jun 2022
International Journal of Artificial Intelligence & Robotics (IJAIR) | VOL. 4

Automatic Classification of Time Series (ACTS): A new clustering method for remote sensing time series
N Viovy
International Journal of Remote Sensing | VOL. 21
N ViovyN Viovy
01 Jan 1999
International Journal of Remote Sensing | VOL. 21

Classification of Annual Precipitations and Identification of Homogeneous Regions using K-Means Method
...
Teknik Dergi | VOL. 23
, et. al. ...
01 Aug 2012
Teknik Dergi | VOL. 23

Fast automatic estimation of the number of clusters from the minimum inter-center distance for k-means clustering
Avisek Gupta ... Swagatam Das
Pattern recognition letters | VOL. 116
Avisek Gupta, et. al.Avisek Gupta ... Swagatam Das
13 Sep 2018
Pattern recognition letters | VOL. 116

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Clusterization by the K-means method when K is unknown

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ITM Web of Conferences