Abstract

Class imbalance classification has become a crucial problem in machine learning. Under-sampling is a widely adopted technique to address imbalance classification, which mainly depends on either randomly or heuristically resampling on the majority class samples. These sample-based under-sampling methods ignore part of the majority class information during the training. In this paper, we propose a clustering-based prototype generation technique to generate representative the majority and minority class instances with relatively balance ratio, which reduces the imbalanced ratio and the overlap of boundary samples, so as to facilitate classification tasks. We evaluate this algorithm on 8 imbalanced datasets, showing that the proposed method outperforms the other three under-sampling approaches.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call