Abstract

This study aims to empirically analyze teaching-learning-based optimization (TLBO) and machine learning algorithms using k-means and fuzzy c-means (FCM) algorithms for their individual performance evaluation in terms of clustering and classification. In the first phase, the clustering (k-means and FCM) algorithms were employed independently and the clustering accuracy was evaluated using different computational measures. During the second phase, the non-clustered data obtained from the first phase were preprocessed with TLBO. TLBO was performed using k-means (TLBO-KM) and FCM (TLBO-FCM) (TLBO-KM/FCM) algorithms. The objective function was determined by considering both minimization and maximization criteria. Non-clustered data obtained from the first phase were further utilized and fed as input for threshold optimization. Five benchmark datasets were considered from the University of California, Irvine (UCI) Machine Learning Repository for comparative study and experimentation. These are breast cancer Wisconsin (BCW), Pima Indians Diabetes, Heart-Statlog, Hepatitis, and Cleveland Heart Disease datasets. The combined average accuracy obtained collectively is approximately 99.4% in case of TLBO-KM and 98.6% in case of TLBO-FCM. This approach is also capable of finding the dominating attributes. The findings indicate that TLBO-KM/FCM, considering different computational measures, perform well on the non-clustered data where k-means and FCM, if employed independently, fail to provide significant results. Evaluating different feature sets, the TLBO-KM/FCM and SVM(GS) clearly outperformed all other classifiers in terms of sensitivity, specificity and accuracy. TLBO-KM/FCM attained the highest average sensitivity (98.7%), highest average specificity (98.4%) and highest average accuracy (99.4%) for 10-fold cross validation with different test data.

Highlights

  • Data mining and machine learning algorithms are efficient in pattern identification, extraction and data separation through clustering and classification [1]

  • This section discusses the outcome of teaching-learning-based optimization (TLBO)-KM/fuzzy c-means (FCM) and machine learning algorithms in different cases

  • (8) The combined average accuracy obtained collectively is approximately 99.4% in case of TLBO was performed using k-means (TLBO-KM) and 98.6% in case of TLBO-FCM

Read more

Summary

Introduction

Data mining and machine learning algorithms are efficient in pattern identification, extraction and data separation through clustering and classification [1]. Data mining and machine learning techniques can identify these hidden patterns and are effective [7] Another aspect is the appropriate association and correlation between methods and their tuning parameters, threshold ranges and attribute dominance factors. The results may suffer in the case of FCM if the dataset is large, and if there is uncertainty in the data objects and optimal parameters setting [9] In these scenarios, the classification approach can be helpful in determining the selection points and preparing uniform data for experimentation. Rao et al proposed the teaching-learning-based optimization (TLBO) algorithm, which only requires common controlling parameters such as population size and number of generations but does not require any algorithm-specific parameters [11] This solves the problem of unsuitable tuning of the parameters. The objective of our study was to analyze the performance of the TLBO with k-means (TLBO-KM) and FCM (TLBOFCM) (TLBO-KM/FCM) algorithms along with machine learning algorithms considering variable parameters and computational aspects

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.