Abstract

Oversampling-based methods achieve impressive performance for classifying imbalanced data. However, many existing oversampling algorithms are still very sensitive to noise. Besides, these algorithms also require one or more parameters, how to set these parameters is very challenging. Furthermore, the generated samples by these algorithms are meaningless or unsafe. To solve these problems, a novel oversampling algorithm for unbalanced classification is presented, named Natural Local Density-based Adaptive Oversampling algorithm (NLDAO). NLDAO has four main advantages: (a) it does not need any parameter due to the use of natural neighbors to calculate local density; (b) it applies a noisy filter to remove noises, which makes the sample boundary cleaner so that the proposed method is robust to noise; (c) it unevenly distributes the generated samples to the minority class, which maintains the original characteristics and enhances the boundary classification ability; (d) it determines the generated region by tracking the sample proportion drop that declines the randomness of the synthesized and reduces the noisy generation. Finally, in experiments, we apply the NLDAO algorithm to artificial datasets to visually demonstrate its effectiveness. Moreover, intensive experiments on real datasets show that NLDAO can achieve better performance than state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.