A comparative analysis of data sets using Machine Learning techniques

C B Abhilash,K Rohitaksha,Shankar Biradar

doi:10.1109/iadcc.2014.6779289

Abstract

Machine Learning techniques are most widely used in the field of clustering of data. The K-means algorithm is one which is widely used algorithm for clustering of data sets and is easy to understand and simulate on different datasets. In our paper work we have used K-means algorithm for clustering of yeast dataset and iris datasets, in which clustering resulted in less accuracy with more number of iterations. We are simulating an improved version in K- means algorithm for clustering of these datasets, the Improved K-means algorithm use the technique of minimum spanning tree. An undirected graph is generated for all the input data points and then shortest distance is calculated which intern results in better accuracy and also with less number of iterations. Both algorithms have been simulated using java programming language; the results obtained from both algorithms are been compared and analyzed. Algorithms have been run for several times under different clustering groups and the analysis results showed that the Improved K- means algorithm has provided a better performance as compared to K-means algorithm; also Improved K-means algorithm showed that, as the number of cluster values increases the accuracy of the algorithm also increases. Also we have inferred from the results that at a particular value of K (cluster groups) the accuracy of Improved K-means algorithm is optimal.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A comparative analysis of data sets using Machine Learning techniques

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Analysis of the characteristics of skill-based street dance movements based on the improved K-means algorithm
Yanping Luo
Applied Mathematics and Nonlinear Sciences | VOL. 9
Yanping LuoYanping Luo
28 Oct 2023
Applied Mathematics and Nonlinear Sciences | VOL. 9

A Novel Approach for Data Clustering using Improved K-means Algorithm
Rishikesh Suryawanshi ... Shubha Puthran
International Journal of Computer Applications | VOL. 142
Rishikesh Suryawanshi, et. al.Rishikesh Suryawanshi ... Shubha Puthran
17 May 2016
International Journal of Computer Applications | VOL. 142

RETRACTED ARTICLE: A study on e-commerce customer segmentation management based on improved K-means algorithm
Yulin Deng ... Qianying Gao
Information Systems and e-Business Management | VOL. 18
Yulin Deng, et. al.Yulin Deng ... Qianying Gao
03 Dec 2018
Information Systems and e-Business Management | VOL. 18

Research and Application of Improved K-means Algorithm in Text Clustering
Shen-Yi Qian ... Dai-Yi Li
DEStech Transactions on Computer Science and Engineering | VOL. -
Shen-Yi Qian, et. al.Shen-Yi Qian ... Dai-Yi Li
27 Jun 2018
DEStech Transactions on Computer Science and Engineering | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A comparative analysis of data sets using Machine Learning techniques

Abstract

Talk to us

Similar Papers