Abstract

Attribute reduction is commonly referred to as the key topic in researching rough set. Concerning the strategies for searching reduct, though various heuristics based forward greedy searchings have been developed, most of them were designed for pursuing one and only one characteristic which is closely related to the performance of reduct. Nevertheless, it is frequently expected that a justifiable searching should explicitly involves three main characteristics: (1) the process of obtaining reduct with low time consumption; (2) generate reduct with high stability; (3) acquire reduct with competent classification ability. To fill such gap, a hybrid based searching mechanism is designed, which takes the above characteristics into account. Such a mechanism not only adopts multiple fitness functions to evaluate the candidate attributes, but also queries the distance between attributes for determining whether two or more attributes can be added into the reduct simultaneously. The former may be useful in deriving reduct with higher stability and competent classification ability, and the latter may contribute to the lower time consumption of deriving reduct. By comparing with 5 state-of-the-art algorithms for searching reduct, the experimental results over 20 UCI data sets demonstrate the effectiveness of our new mechanism. This study suggests a new trend of attribute reduction for achieving a balance among various characteristics.

Highlights

  • Attribute reduction [1,2], as one filter feature selection technique emerges in rough set [3,4,5], plays a crucial role in the field of data dimension reduction

  • Take “Ionosphere” data set and Akashata’s measure as an example, the stability of reducts which obtained by using Forward Greedy Searching (FGS), Attribute Group for Attribute Reduction (AGAR), Ensemble Selector for Attribute Reduction (ESAR), Dissimilarity for Attribute Reduction (DAR), Data-Guidance for Attribute Reduction (DGAR) and Hybrid Mechanism for Attribute Reduction (HMAR) are 0.2562, 0.1213, 0.6545, 0.2986, 0.2818 and 0.4416, respectively

  • Take “Forest Type Mapping” data set as an example, the elapsed time of obtaining reducts by using FGS, AGAR, ESAR, DAR, DGAR and HMAR, 0.0161, 0.0137, 0.0217, 0.0050, 0.0724 and 0.0175 s are required, respectively

Read more

Summary

Introduction

Attribute reduction [1,2], as one filter feature selection technique emerges in rough set [3,4,5], plays a crucial role in the field of data dimension reduction. If the form of the attribute reduction is fully defined, how to derive such qualified reduct is the key. Though the optimal reduct can be obtained through using exhaustion, the time consumption is frequently too high to be accepted because exhaustion is designed for finding all reducts. For such reason, the heuristics based searching [6,7]

Methods
Findings
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.