Abstract

Cross-Company Churn Prediction (CCCP) is a domain of research where one company (target) is lacking enough data and can use data from another company (source) to predict customer churn successfully. To support CCCP, the cross-company data is usually transformed to a set of similar normal distribution of target company data prior to building a CCCP model. However, it is still unclear which data transformation method is most effective in CCCP. Also, the impact of data transformation methods on CCCP model performance using different classifiers have not been comprehensively explored in the telecommunication sector. In this study, we devised a model for CCCP using data transformation methods (i.e., log, z-score, rank and box-cox) and presented not only an extensive comparison to validate the impact of these transformation methods in CCCP, but also evaluated the performance of underlying baseline classifiers (i.e., Naive Bayes (NB), K-Nearest Neighbour (KNN), Gradient Boosted Tree (GBT), Single Rule Induction (SRI) and Deep learner Neural net (DP)) for customer churn prediction in telecommunication sector using the above mentioned data transformation methods. We performed experiments on publicly available datasets related to the telecommunication sector. The results demonstrated that most of the data transformation methods (e.g., log, rank, and box-cox) improve the performance of CCCP significantly. However, the Z-Score data transformation method could not achieve better results as compared to the rest of the data transformation methods in this study. Moreover, it is also investigated that the CCCP model based on NB outperform on transformed data and DP, KNN and GBT performed on the average, while SRI classifier did not show significant results in term of the commonly used evaluation measures (i.e., probability of detection, probability of false alarm, area under the curve and g-mean).

Highlights

  • In the last decade, the companies are moving towards the new technological paradigms in order to progressively eliminate the barriers in information transformation (Ferreira, Moreira, & Seruca, 2017; Henriette, Feki, & Boughzala, 2015)

  • We study the following research questions: RQ1: What is the effect of data transformation (DT) methods (i.e., Log, Rank, Box-Cox, and Z-Score) on data normality in Cross-Company Churn Prediction (CCCP)?

  • In order to report RQ1 (What is the effect of DT methods (i.e., Log, Rank, Box-Cox, and Z-Score) on data normality in CCCP?), we applied Log, Rank, Box-Cox and Z-Score DT methods on the subject datasets

Read more

Summary

Introduction

The companies are moving towards the new technological paradigms in order to progressively eliminate the barriers in information transformation (Ferreira, Moreira, & Seruca, 2017; Henriette, Feki, & Boughzala, 2015). This technological shift allowed the companies for integration of technological innovations such as So-cial technologies, Mobile connectivity, Cloud computing and Big Data analytics (Uhl & Gollenia, 2016). Since all the data related to the business customers are usually stored and organized in Customer Relationship Management (CRM) (Bull, 2010; Reychav & Weisberg, 2009)

Methods
Results
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.