Abstract

Attribute selection has a significant effect on the performance of the machine learning studies by selecting the attributes having significant effect on result, reducing the number of attributes, and reducing the calculation cost. In this study, a new attribute selection method which is a combination of the R-correlation coefficient-based attribute selection (RCBAS) and the ρ-correlation coefficient-based attribute selection (ρCBAS) called the Two-Stage Correlation-Based Attribute Selection (TSCBAS) is proposed to select significant attributes. The proposed attribute selection method has been applied to customer churn prediction on a telecommunications dataset for performance evaluation. The dataset used in the study includes real customer call records details for the years 2013 and 2014 obtained from a major telecommunications company in Turkey. Apart from the proposed attribute selection method, four different methods named Rcorrelation coefficient-based attribute selection, ρ-correlation coefficient-based attribute selection, ReliefF, and Gain Ratio have been used for creating five datasets. After that, four classifier algorithms including Random Forest, C4.5 Decision Tree, Naive Bayes and AdaBoost.M1 have been applied. The obtained results have been compared according to the performance metrics comprising Accuracy (ACC), Sensitivity (TPR), Specificity (SPC), F-measure (F), AUC (area under the ROC curve), and run-time. The results of the comparisons show that the proposed attribute selection algorithm outperforms the state of the art methods on customer churn prediction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.