Abstract

The Rough Sets Theory is used in data mining with emphasis on the treatment of uncertain or vague information. In the case of classification, this theory implicitly calculates reducts of the full set of attributes, eliminating those that are redundant or meaningless. Such reducts may even serve as input to other classifiers other than Rough Sets. The typical high dimensionality of current databases precludes the use of greedy methods to find optimal or suboptimal reducts in the search space and requires the use of stochastic methods. In this context, the calculation of reducts is typically performed by a genetic algorithm, but other metaheuristics have been proposed with better performance. This work proposes the innovative use of two known metaheuristics for this calculation, the Variable Neighborhood Search, the Variable Neighborhood Descent, besides a third heuristic called Decrescent Cardinality Search. The last one is a new heuristic specifically proposed for reduct calculation. Considering some databases commonly found in the literature of the area, the reducts that have been obtained present lower cardinality, i.e., a lower number of attributes.

Highlights

  • Large amounts of data are generated everyday and the ability to analyze them is normally a challenge

  • Three alternative metaheuristics are evaluated for the calculation of reducts in Rough Sets Theory (RST): Variable Neighborhood Search (VNS), Variable Neighborhood Decent (VND) and Decrescent Cardinality Search (DCS), which are described in Sections 4.1, 4.2 and 4.3, respectively

  • Each test case is related to the use of a specific metaheuristic for the calculation of reducts applied to the classification of a specific database

Read more

Summary

Introduction

Large amounts of data are generated everyday and the ability to analyze them is normally a challenge. The present work evaluates alternative metaheuristics for reduct calculation in RST, being applied for the classification of the same databases. Two different functions were evaluated in this work for the calculation of reducts in RST: the dependency degree between attributes and the relative dependency [8] These functions, presented, allows simplifying the calculation of the reducts in comparison with the standard approach based on the discernibility matrix, which presents high computational complexity, not being feasible for large and complex datasets. Three alternative metaheuristics are evaluated for the calculation of reducts in RST: VNS, VND and DCS, which are described in Sections 4.1, 4.2 and 4.3, respectively. Each test case is related to the use of a specific metaheuristic for the calculation of reducts applied to the classification of a specific database

Rough Set Theory
Indiscernibility Relations
Reducts
Attribute Reduction in Rough Set-Theory
Discernibility Matrix-Based Approaches
Dependency Function-Based Approaches
Attribute Reduction Based on Relative Attribute Dependency
Metaheuristics Applied to the Calculation of Reducts in RST
Local Search Schemes
Results
Final Remarks
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.