Abstract

Positive and unlabeled (PU) learning is to learn a binary classifier with good generalization ability from PU data. A variety of PU learning algorithms with promising performance have been proposed. However, most of them assume that PU samples are “clean”, which is not true in real applications due to the existing noisy or redundant samples. Thus, how to obtain a robust PU classifier with better performance is a challenging problem. To this end, we propose a novel multi-objective evolutionary algorithm to tackle it, named BPUSS-MOEA. Specifically, we firstly transform the robust PU learning into a bi-objective PU sample selection (BPUSS) problem, in which two objectives are designed. One is the number of selected “clean” PU samples and the other is the PU accuracy. Then, a dual-coding scheme is designed to represent the selected “clean” PU samples and the labels of U samples. With the dual-coding scheme, a novel offspring generation strategy is developed to achieve the offsprings with high quality. To further improve the performance of BPUSS-MOEA, an effective population initialization strategy is designed. Experiments on 10 datasets with different noise levels show that compared with the state-of-the-arts, the proposed algorithm demonstrates its robustness in terms of the PU accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.