Abstract

Indonesian government agencies under the Ministry of Energy and Mineral Resources have problems in classifying data dictionary of coal. This research conduct grouping coal dictionary using K-Means and MeanShift algorithm. K-means algorithm is used to get cluster value on character and word criteria. The last iteration of Euclidian distance calculation data on k-means combine with Meanshift algorithm. The meanshift calculates centroid by selecting different bandwidths. The result of grouping using k-means and meanshift algorithm shows different centroid to find optimum bandwidth value. The data dictionary of this research has sorted in alphabetically.

Highlights

  • Data Mining is a process of extracting data or filtering data by utilising a collection of a big data

  • Puslitbang tekMIRA has a dictionary of coal

  • K-Means Clustering is a grouping of data, where the data in K Means Clustering K is the amount of data or the number of constants

Read more

Summary

Introduction

Data Mining is a process of extracting data or filtering data by utilising a collection of a big data. It takes time between three to five minutes to find a term inside a coal dictionary Based on these problems, data mining method is used to classify data dictionary. This research conduct grouping data dictionary of a coal term using data mining method. In this research the algorithm grouping a coal term in a dictionary based on character and word in a cluster. The result data of clustering using K-Means and Meanshift algorithm is shown using the matloplib plot. Means is the average value of the data set as Cluster [7]. K-Means is an algorithm used to generate k clusters from a collection of data sets in a simple way [14]. The cluster center is the average of all data/objects in a particular cluster. Bandwidth is a free parameter that shows the effect on the estimated density generated

Results and Analysis
TEL KOM NIK A
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call