Abstract

There are a lot of imbalanced data in many fields such as finance, information security and industrial system. How to extract valuable information from imbalanced data is a research hotspot and focus. In this paper, the imbalanced consumption data of credit card in UCI database are used. Oversampling SMOTE method, AdaBoost algorithm and cost-sensitive algorithm are used to process the imbalanced data. Empirical results show that SMOTE-AdaBoost method is better than traditional AdaBoost method, and the cost-sensitive algorithm that increases the weight of the minority class samples is also higher than that of traditional AdaBoost method. Finally, this paper describes the challenges and future research directions of imbalanced data classification.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call