Neighborhood Rough Set Model Research Articles

The advent of the era of big data is accompanied by the generation of large-scale data of various types. Extracting the potential value and rules from such data has always been a challenge. Due to various external and internal factors, it is commonplace for large-scale data to exhibit the phenomenon of missing limited labels. In addressing a large-scale mixed information system with limited label missing (LSMDISLML), local neighborhood rough set model (LNRS-model) is typically employed. However, the identical neighborhood radius is often used by such model when confronted with numerical attributes, which could potentially attenuate the classification capability of the data. Local fuzzy rough set model (LFRS-model) can overcomes this point. This paper studies local fuzzy rough attribute reduction for large-scale mixed data with limited missing labels based on LFRS-model via local fuzzy self information and overlap degree function. First, leveraging the statistical distribution of data as a foundation, fuzzy relations on the entire sample set are established, which has the advantage of being able to use different fuzzy similarity radii to calculate similarity, thereby adapting to different data distributions. Subsequently, the samples with missing labels are discarded as they constitute a small proportion of the entire sample set and have little impact on overall performance of dataset. The limited computing resources and storage space are focused on the sample set with complete labels (denoted as target set). Thereafter, based on the target set, local fuzzy λ-upper and lower approximations are defined, and LFRS-model is constructed. This model not only reduces processing time and sources of error in large-scale data but also improves data quality and enhances the reliability of the experimental results. Then, local fuzzy λ-self information is introduced and used to design a local fuzzy rough attribute reduction algorithm in a LSMDISLML. Furthermore, a overlap degree function is introduced to evaluate and reorder the attributes based on their importance, prioritizing the elimination of redundant attributes with high overlap and low importance from the preordered attribute set. This strategy effectively improves the efficiency of obtaining the optimal subset. Finally, a series of experiments are carried out. The experiment results demonstrate that the designed algorithm exhibits excellent performance in classification tasks and outlier detection tasks, surpassing existing four algorithms.

Read full abstract

Extracting knowledge from hybrid data, comprising both categorical and numerical data, poses significant challenges due to the inherent difficulty in preserving information and practical meanings during the conversion process. To address this challenge, hybrid data processing methods, combining complementary rough sets, have emerged as a promising approach for handling uncertainty. However, selecting an appropriate model and effectively utilizing it in data mining requires a thorough qualitative and quantitative comparison of existing hybrid data processing models. This research aims to contribute to the analysis of hybrid data processing models based on neighborhood rough sets by investigating the inherent relationships among these models. We propose a generic neighborhood rough set-based hybrid model specifically designed for processing hybrid data, thereby enhancing the efficacy of the data mining process without resorting to discretization and avoiding information loss or practical meaning degradation in datasets. The proposed scheme dynamically adapts the threshold value for the neighborhood approximation space according to the characteristics of the given datasets, ensuring optimal performance without sacrificing accuracy. To evaluate the effectiveness of the proposed scheme, we develop a testbed tailored for Parkinson’s patients, a domain where hybrid data processing is particularly relevant. The experimental results demonstrate that the proposed scheme consistently outperforms existing schemes in adaptively handling both numerical and categorical data, achieving an impressive accuracy of 95% on the Parkinson’s dataset. Overall, this research contributes to advancing hybrid data processing techniques by providing a robust and adaptive solution that addresses the challenges associated with handling hybrid data, particularly in the context of Parkinson’s disease analysis.

Read full abstract

Neighborhood Rough Set Model Research Articles

Related Topics

Articles published on Neighborhood Rough Set Model

Local fuzzy rough attribute reduction for large-scale mixed data with limited missing labels based on local fuzzy self information

Neighborhood margin rough set: Self-tuning neighborhood threshold

Feature selection based on neighborhood complementary entropy for heterogeneous data

A novel adaptive neighborhood rough sets based on sparrow search algorithm and feature selection

Multi-label feature selection using self-information in divergence-based fuzzy neighborhood rough sets

Fusing multiple interval-valued fuzzy monotonic decision trees

Attribute reduction for heterogeneous data based on monotonic relative neighborhood granularity

Distance metric learning-based multi-granularity neighborhood rough sets for attribute reduction

Adaptive neighborhood rough set model for hybrid data processing: a case study on Parkinson’s disease behavioral analysis

Dynamic multi-label feature selection algorithm based on label importance and label correlation

A local rough set method for feature selection by variable precision composite measure

Heterogeneous Feature Selection Based on Neighborhood Combination Entropy.

Feature Selection for Unbalanced Distribution Hybrid Data Based on ${k}$-Nearest Neighborhood Rough Set

Information fusion and attribute reduction for multi-source incomplete mixed data via conditional information entropy and D-S evidence theory

A new method for feature selection based on weighted [formula omitted]-nearest neighborhood rough set

GRRS: Accurate and Efficient Neighborhood Rough Set for Feature Selection

An Emerging Fuzzy Feature Selection Method Using Composite Entropy-Based Uncertainty Measure and Data Distribution

A soft neighborhood rough set model and its applications

Multi-label feature selection based on label distribution and neighborhood rough set

Noise-resistant multilabel fuzzy neighborhood rough sets for feature subset selection

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Neighborhood Rough Set Model Research Articles

Related Topics

Articles published on Neighborhood Rough Set Model

Local fuzzy rough attribute reduction for large-scale mixed data with limited missing labels based on local fuzzy self information

Neighborhood margin rough set: Self-tuning neighborhood threshold

Feature selection based on neighborhood complementary entropy for heterogeneous data

A novel adaptive neighborhood rough sets based on sparrow search algorithm and feature selection

Multi-label feature selection using self-information in divergence-based fuzzy neighborhood rough sets

Fusing multiple interval-valued fuzzy monotonic decision trees

Attribute reduction for heterogeneous data based on monotonic relative neighborhood granularity

Distance metric learning-based multi-granularity neighborhood rough sets for attribute reduction

Adaptive neighborhood rough set model for hybrid data processing: a case study on Parkinson’s disease behavioral analysis

Dynamic multi-label feature selection algorithm based on label importance and label correlation

A local rough set method for feature selection by variable precision composite measure

Heterogeneous Feature Selection Based on Neighborhood Combination Entropy.

Feature Selection for Unbalanced Distribution Hybrid Data Based on ${k}$-Nearest Neighborhood Rough Set

Information fusion and attribute reduction for multi-source incomplete mixed data via conditional information entropy and D-S evidence theory

A new method for feature selection based on weighted [formula omitted]-nearest neighborhood rough set

GRRS: Accurate and Efficient Neighborhood Rough Set for Feature Selection

An Emerging Fuzzy Feature Selection Method Using Composite Entropy-Based Uncertainty Measure and Data Distribution

A soft neighborhood rough set model and its applications

Multi-label feature selection based on label distribution and neighborhood rough set

Noise-resistant multilabel fuzzy neighborhood rough sets for feature subset selection