Optimization of Classification Algorithms Performance with k-Fold Cross Validation

Moch Anjas Aprihartha,Idham Idham

doi:10.29303/emj.v7i2.212

Moch Anjas Aprihartha, Idham Idham

Open Access

https://doi.org/10.29303/emj.v7i2.212

Copy DOI

Export

Save

Cite

Journal: EIGEN MATHEMATICS JOURNAL	Publication Date: Sep 20, 2024
License type: CC BY-NC-SA 4.0

Abstract
Full-Text
Similar Papers

Abstract

Listen

Supervised learning is a predictive method used to make predictions or classifications. Supervised learning algorithms work by building a model using training data that includes both independent and dependent variables. Several methods for building classification include Logistic Regression, Naive Bayes, K-Nearest Neighbor (KNN), decision tree, etc. The lack of capacity of a classification algorithm to generalize certain data can be associated with the problem of overfitting or underfitting. K-fold cross-validation is a method that can help avoid overfitting or underfitting and produce a algorithm with good performance on new data. This study will test the Naive Bayes, K-Nearest Neighbor (KNN), Classification and Regression Tree (CART), and Logistic Regression methods with k-fold cross-validation on two different datasets. The values of k set for cross-validation are 2, 3, 5, 7, and 10. The analysis results concluded that each classification algorithm performed best at 10-fold cross-validation. In DATA 1, the Naive Bayes algorithm has the highest average accuracy of 0.67 (67%) and the error rate is 0.33 (33%), followed by the CART algorithm, KNN, and finally logistic regression. While DATA 2, the KNN algorithm has the highest average accuracy of 0.66 (66%) and an error rate of 0.34 (34%), followed by the CART algorithm, Naive Bayes, and finally logistic regressionbut can be a reference if you want to predict the growth direction of the accommodation and food service activities sector.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Optimization of Classification Algorithms Performance with k-Fold Cross Validation

Abstract

Published Version

Talk to us

Similar Papers

More From: EIGEN MATHEMATICS JOURNAL

Lead the way for us

Similar Papers

Performance Comparison of CART And KNN Algorithms for Analyzing Early Predictions of Mental Health
Eling Sekar Anggraeni ... Pungkas Subarkah
Journal of Computer Networks, Architecture and High Performance Computing | VOL. 6
Eling Sekar Anggraeni, et. al.Eling Sekar Anggraeni ... Pungkas Subarkah
22 Jul 2024
Journal of Computer Networks, Architecture and High Performance Computing | VOL. 6

Design of an international trade target supplier evaluation system based on CART algorithm
Rui Liu
Intelligent Decision Technologies | VOL. 18
Rui LiuRui Liu
16 Sep 2024
Intelligent Decision Technologies | VOL. 18

Performance Analysis of Classification and Regression Tree (CART) Algorithm in Classifying Male Fertility Levels with Mobile-Based
Arif Rahman Hakim ... Dewi Marini Umi Atmaja
Tech-E | VOL. 7
Arif Rahman Hakim, et. al. Arif Rahman Hakim ... Dewi Marini Umi Atmaja
28 Aug 2023
Tech-E | VOL. 7

Prediction Model for Intention of Mental Health Promotion Program among Korean Elderly Persons Using CART Algorithm
Haewon Byeon ... Sung-Hyoun Cho
Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology | VOL. 5
Haewon Byeon, et. al.Haewon Byeon ... Sung-Hyoun Cho
31 Oct 2015
Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Optimization of Classification Algorithms Performance with k-Fold Cross Validation

Abstract

Published Version

Talk to us

Similar Papers

More From: EIGEN MATHEMATICS JOURNAL