Comparative Performance Evaluation Results of Classification Algorithm in Data Mining to Identify Types of Glass Based on Refractive Index and It’s Elements

Rinto Suppa

doi:10.51557/pt_jiit.v8i1.1705

Abstract

Data science is becoming familiar to the public and companies in the era of Industrial Revolution 4.0. One part of data science is data mining. Data mining is the process of collecting information to see patterns from very large datasets and data discovery which is processed in such a way as to become knowledge based on the interpretation of the information obtained. The purpose of this paper is to compare the performance evaluation results of several classification algorithms in data mining (such as DT C-45, Neural Network, KNN, LDA, Naïve Bayes, SVM, and Rule Induction) for identifying types of glass based on the refractive index and its elements. The data set used is a glass identification dataset taken from the UCI Machine Learning Repository. The results of the evaluation can be seen from the criteria like of Accuracy and Kappa using 10 fold cross validation. As a result, the K-Nearest Neighbors (KNN) algorithm has the best Accuracy and Kappa values, namely 72.90% for Accuracy and 0.632 for Kappa values. To determine the significance of the accuracy value, the T-Test method is used.

Full Text