Abstract

Class imbalance hinders the performance of some standard classifiers. However, class imbalance may not be solely responsible for the decrease in performance. Research efforts show that imbalanced datasets suffer from overlapping problems and borderline samples, which often deteriorate the classification performance. Conventional imbalanced learning methods mainly focus on balancing the distribution between classes but ignore difficulties caused by the above problems, thus underperforming drastically. This paper proposes a hybrid network called SemiPro-Empha to alleviate the aforementioned problems by learning a feature space with good inter-class separability and intra-class compactness. SemiPro-Empha comprises two modules: a feature learning loss called Semi-Prototype contrastive loss (Semi-Proto), guiding the feature extractor to learn a feature space where the projections of original overlapping classes can be separated, thereby improving classification performance. Additionally, this paper also presents a robust valuable borderline sample mining strategy called Emphasizing (Empha). Emphasizing identifies “valuable” borderline samples and eliminates noisy samples to create an auxiliary training dataset during each training epoch, providing up-to-date global classification boundary information for the training model while ensuring its robustness. Extensive experiments conducted on the breast cancer dataset and seven imbalanced datasets demonstrate the effectiveness of SemiPro-Empha.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.