Abstract

High-dimensionality is the most noticeable characteristic of multilabel data. In practice, multilabel data typically contain complex noises. Ignoring these noises in the feature selection process tends to cause an inaccurate prediction. Besides, many existing multilabel feature selection methods assume that the relation between samples and relevant labels is crisp, and the difference information hiding in the label space is lost. Given all these, in this paper, soft labels are investigated by label enhancement in multilabel feature selection. A robust feature selection algorithm is proposed using label enhancement and multilabel β-precision fuzzy rough sets. With the perspective of data distribution from samples in the same and different class, the margin-based robust fuzzy neighborhood is first defined to construct the robust fuzzy granular space. Second, label enhancement strategy is given in the robust fuzzy granular space considering multilabel data distribution. To investigate the noise-tolerant model, the underlying structure of label space after label enhancement is employed to encode the score vector of samples, which is used to search the pseudo-different class's samples of target sample. Then, the multilabel β-precision fuzzy rough set model is built to deal with multilabel data. Moreover, the fuzzy approximation degree of knowledge and the fuzzy dependency of decision classes with respect to conditional features are fused to measure the significance of features. Finally, a robust heuristic multilabel feature selection algorithm is proposed. Extensive experiments on classification performance and anti-noise ability are conducted, which verify that the proposed algorithm is effective and robust.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.