Abstract

With increasing competition in the business world, many companies use data mining  techniques to determine the level of customer loyalty. The customer data used in this  study is the german credit dataset obtained from UCI. Such data have an imbalance  problem of class because the amount of data in the loyal class is more than in the  churn class. In addition, there are some irrelevant attributes for customer  classification, so attributes selection is needed to get more accurate classification  results. One classification algorithm is naive bayes. Naive Bayes has been used as an  effective classification for years because it is easy to build and give an independent  attribute into its structure. The purpose of this study is to improve the accuracy of the  Naive Bayes for customer classification. SMOTE and genetic algorithm do for  improving the accuracy. The SMOTE is used to handle class imbalance problems,  while the genetic algorithm is used for attributes selection. Accuracy using the Naive  Bayes is 47.10%, while the mean accuracy results obtained from the Naive Bayes  with the application of the SMOTE is 78.15% and the accuracy obtained from the  Naive Bayes with the application of the SMOTE and genetic algorithm is 78.46%.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.