Abstract

Fuzzy rough set theory can model uncertainty in data and has been applied to feature selection for machine learning tasks. The existence of noise in data is one of the reasons for data uncertainty. However, most classical fuzzy rough set models are often sensitive to the noise in data, which somewhat degrades their applicability to process uncertainty of data. Furthermore, a robust feature evaluation function is nontrivial in a fuzzy rough set model as non-optimal feature subsets may be selected due to the perturbations from redundant features. In this paper, we delve into local density and indispensable features for fuzzy rough feature selection to address these challenges. We first propose a Local Density-based Fuzzy Rough Set (LDFRS) model to tackle noisy data. Mutual information is then plugged into the proposed LDFRS model to evaluate uncertainty in data. A joint feature evaluation function on the indispensability and relevance of features is constructed to evaluate the significance of features. On this basis, a fuzzy rough feature selection algorithm is built upon the LDFRS model. Experimental results using four typical classifiers demonstrate the robustness and effectiveness of the proposed model including our feature selection algorithm and its superiority against baseline methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.