Abstract
Customer retention is a challenging and critical issue in telecommunication and service-based sectors. Various researchers have established the need for a service-based company to retain their existing customers much cheaper than acquiring new ones. However, the predictive models for observing customers’ behavior is one of the great instruments in the customer retention process and inferring the future behavior of the customers. Selecting the right and best model is another herculean task because the performances of predictive models are greatly affected when the real-world dataset is highly imbalanced. The study analyses the performance of homogeneous ensembles; bagging, boosting, rotation forest, cascade, and dagging. These ensembles were applied to both raw and balanced datasets to compare the performance of the models. The data sampling method (oversampling) was adopted to balance the raw dataset. The primary metric used for the evaluation of the performance of the models was Accuracy and ROC/AUC (Receiver Operating Characteristics/Area Under Curve). Weka 3.8.5 machine learning tool used to analyze and develop the models. The study reveals that Bagging had the best performance having an AUC of 0.987, followed by boosting and Rotation Forest both with an AUC of 0.985.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: International Journal of Research and Innovation in Applied Science
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.