Abstract

Due to challenges posed by mixed data clustering, this study aims to introduce an innovative clustering-based classification algorithm that possesses the advantages of both classification and clustering techniques for mixed data analysis. The proposed algorithm employs the K-prototypes algorithm with a genetic algorithm to optimize weights and centroids and utilizes the bagging method to build multiple classifiers, thereby enhancing classification performance. Furthermore, it incorporates four mutation mechanisms, including Gaussian, Cauchy, Levy, and single-point mutations, to explore optimal solutions. This study suggests using a 20% sampling ratio for the bootstrap sampling in the proposed algorithm. This ratio has been proven to be sufficient for achieving good classification performance while reducing computational time. Experimental results indicate that the proposed algorithm outperforms benchmark classifiers, demonstrating superior classification performance across five performance indicators. In addition, the loan eligibility case study offers valuable insights into applying the proposed algorithm in real-world scenarios, demonstrating that the proposed algorithm can achieve superior classification performance compared to other algorithms. It also offers managerial implications to help different industries and fields understand the appropriate timing and scenarios for implementing the algorithm.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call