Identification of Dry Bean Varieties Based on Multiple Attributes Using CatBoost Machine Learning Algorithm

S Krishnan,Ellappan Venugopal,Karthick Kanagarathinam,S K Aruna

doi:10.1155/2023/2556066

S Krishnan, Ellappan Venugopal + Show 2 more

Open Access

https://doi.org/10.1155/2023/2556066

Copy DOI

Abstract

Dry beans are the most widely grown edible legume crop worldwide, with high genetic diversity. Crop production is strongly influenced by seed quality. So, seed classification is important for both marketing and production because it helps build sustainable farming systems. The major contribution of this research is to develop a multiclass classification model using machine learning (ML) algorithms to classify the seven varieties of dry beans. The balanced dataset was created using the random undersampling method to avoid classification bias of ML algorithms towards the majority group caused by the unbalanced multiclass dataset. The dataset from the UCI ML repository is utilised for developing the multiclass classification model, and the dataset includes the features of seven distinct varieties of dried beans. To address the skewness of the dataset, a Box-Cox transformation (BCT) was performed on the dataset’s attributes. The 22 ML classification algorithms have been applied to the balanced and preprocessed dataset to identify the best ML algorithm. The ML algorithm results have been validated with a 10-fold cross-validation approach, and during validation, the CatBoost ML algorithm achieved the highest overall mean accuracy of 93.8 percent, with a range of 92.05 percent to 95.35 percent.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Programming	Publication Date: Apr 21, 2023
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Identification of Dry Bean Varieties Based on Multiple Attributes Using CatBoost Machine Learning Algorithm

Abstract

Talk to us

Similar Papers

More From: Scientific Programming

Lead the way for us

Similar Papers

Application of classification machine learning algorithms for characterizing nutrient transport in a clay plain agricultural watershed
Ahmed Elsayed ... Pradeep Goel
Journal of Environmental Management | VOL. 345
Ahmed Elsayed, et. al.Ahmed Elsayed ... Pradeep Goel
06 Sep 2023
Journal of Environmental Management | VOL. 345

Enhancement of text categorization results via an ensemble learning technique
Wasf A Taha ... Suhad A Yousif
-
Wasf A Taha, et. al.Wasf A Taha ... Suhad A Yousif
01 Jan 2023
01 Jan 2023

Practical Implications of Dequantization on Machine Learning Algorithms: A Survey
Vinooth Rao Kulkarni ... Shuai Xu
-
Vinooth Rao Kulkarni, et. al.Vinooth Rao Kulkarni ... Shuai Xu
01 Jan 2023
01 Jan 2023

Transfer-Ensemble Learning: A Novel Approach for Mapping Urban Land Use/Cover of the Indian Metropolitans
Prosenjit Barman ... Sudhir Kumar Singh
Sustainability | VOL. 15
Prosenjit Barman, et. al.Prosenjit Barman ... Sudhir Kumar Singh
06 Dec 2023
Sustainability | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identification of Dry Bean Varieties Based on Multiple Attributes Using CatBoost Machine Learning Algorithm

Abstract

Talk to us

Similar Papers

More From: Scientific Programming