Abstract

The extended belief-rule-based (EBRB) system has become a widely recognized and effective rule-based system in decision-making. The system uses a data-driven method to generate the rule base by transforming each training sample into a rule. Hence, when an EBRB system is applied in an imbalanced classification dataset, the imbalance of training dataset will retain in the generated rule base. More specifically, the number of rules transformed from majority classes will be far greater than the rules transformed from minority classes. This issue usually leads to a sharp decrease in the accuracies of minority classes. This study analyses how the imbalance of training dataset exists in the generated EBRB and then proposes a Balance Adjusting (BA) approach to eliminate the influence of imbalance in the rule base. The BA approach adjusts rule activation weights of all activated rules, and further enhances the competitiveness of rules with higher activation weight during the rule aggregation process of the EBRB system. Several case studies in imbalanced benchmark classification datasets from UCI demonstrate how the use of the BA approach improves the performance of the EBRB system. This study also conducts a series of experiments to validate the improvement of the proposed approach compared with some conventional and recent existing works. The comparison results illustrate that the BA approach is feasible, effective and robust, and it performs well especially in large scale datasets. Moreover, the BA approach can also combine with various rule activation weight calculation methods, which means it might worth to be applied as a generic process before the rule aggregation process of the EBRB system.

Highlights

  • Data imbalance is a common problem occurs in classification

  • The Balance Adjusting (BA) approach improves the performance of extended belief-rule-based (EBRB) system applied in imbalanced classification problems, and proves that the system itself is a powerful tool for classification but limited by the imbalance in its rule base

  • In this study, the BA approach is proposed to improve the performance of EBRB systems applied in imbalanced classification datasets

Read more

Summary

INTRODUCTION

Data imbalance is a common problem occurs in classification. An imbalanced dataset consists of majority classes and minority classes. W. Fang et al.: Balance Adjusting Approach of EBRB System for Imbalanced Classification Problem and it has been widely applied in many fields [2]–[4]. Fang et al.: Balance Adjusting Approach of EBRB System for Imbalanced Classification Problem and it has been widely applied in many fields [2]–[4] It can handle both quantitative and qualitative information, and is considered to be more interpretable than deep-learning-based tools. Since each training sample will be transformed into a rule by the data-driven method, the number of rules belong to each class will be the same as the number of training samples belong to each class, which means that the number of rules which belong to majority classes will be far greater than the number of rules which belong to majority classes Such kind of rule base is called imbalanced rule base in this study.

OVERVIEW OF BRB SYSTEM AND EBRB SYSTEM
OVERVIEW OF BRB SYSTEM
EXTENDED BELIEF RULE
THE DATA-DRIVEN CONSTRUCTION METHOD OF EBRB SYSTEM
THE EVIDENTIAL REASONING APPROACH OF EBRB SYSTEM
CASE STUDIES
PERFORMANCE OF EBRB SYSTEM OPTIMIZED BY THE BA APPROACH
EFFECTIVENESS VALIDATION OF THE PROPOSED APPROACH
COMPARE WITH CONVENTIONAL CLASSIFICATION APPROACHES
Findings
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.