Abstract

Customer churn prediction using data mining is an increasingly important concern in the highly saturated telecommunication sector. Albeit its popularity in research, few studies investigated the use of heterogeneous ensembles for this purpose. Therefore, this study evaluated and compared the performance of five grid-searched optimized base classifiers (logistic regression, decision trees, K-nearest neighbors, multilayer perceptron, and support vector machines) and their heterogeneous ensembles (stacking, grading, majority voting, weighted majority voting, and soft voting) on four different telecom datasets. The results indicate that there are significant improvements when using heterogeneous ensembles compared to single classifiers, with stacking being the most performant ensemble. The best meta-classifier for stacking was found to be a multilayer perceptron. Additionally, we identified that using probabilities as an input to the ensemble's meta-classifier, such as in soft voting and stacking variants, can increase their performance.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.