Abstract

Abstract The telecommunications industry is representative when it comes to a country’s economy. In this industry, the customer plays a very important role in maintaining a stable income. The churn customer is one of the most important concerns for large companies. This increased attention is due to its direct effect on the revenues of large companies in the telecommunications industry, companies being in a constant search to develop ways to predict this type of customer. The aim of our paper is to identify potential customers at risk of churn using modern data mining techniques, often used in the business world. From the nine techniques tested, we choose as the churn prediction model, the technique with the highest performance. The effectiveness of the model is tested and evaluated by the f1-score. The model developed in the paper uses machine learning techniques on the Python platform, exploring a wide range of algorithms from logistic regression and the method of balancing the analyzed data set (Balanced Random Forest) to supervised learning methods (K-Nearest Neighbors, Naive Bayes) and optimization packages (Ligh GBM, CATBoost, ADABoost, RUSBoost, Stochastic Gradient Descent). The techniques analyzed in this paper cover a diverse range of methods that are compared in terms of performance. RUSBoost proves to be the best churn prediction model for telecom customers in this study. RUSBoost has the lowest loss function of all the tested techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.