Abstract

Existing approaches for handling imbalanced problem are based on the discriminant approaches, while only little attention is dedicated to mining the probability information provided by generative approaches. Moreover, the multi-view learning trains classifier through combining different representations of data for improving the performance of classifier in imbalanced classification. In this paper, a learning framework consisting of fisher kernel and Bi-Bagging is proposed for imbalanced problem. The Fisher kernel is employed to integrate the probability information into the pristine feature of data. Thus, the generated fisher vector contain better discriminatory information. However, the generated fisher vector may lead to high-dimension overfitting. So the dataset represented by the fisher vector is then processed by Bi-Bagging to generate multi-view data and balanced training subsets, which not only reduces the high dimension of generated fisher vector but also promotes the accuracy of minority instances. In one word, the combination of fisher kernel and Bi-Bagging makes use of the probability information in the pristine feature and generates balanced multi-view training subsets with adequate dimension. Therefore, the proposed learning framework is independent of specific models, and the base classifier of the learning framework can be replaced by different linear classifier. Two experimental strategies are implemented to validate the effectiveness of the proposed learning framework for imbalanced datasets on 30 KEEL datasets.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.