Neighborhood Decision Systems Research Articles

Feature selection for mixed and incomplete data in terms of numerical and categorical features with missing values has currently gained considerable attention. The development of the neighborhood rough sets-based feature selection method is an important step in improving classification performance, especially in incomplete data with mixed continuous numerical and categorical features. In this paper, a novel feature selection method based on the neighborhood rough sets using Lebesgue and entropy measures in incomplete neighborhood decision systems is proposed, and the method has the capacity to handle mixed and incomplete datasets; further, it can simultaneously maintain the original classification information. First, a Lebesgue measure based on the neighborhood tolerance class is developed to study the positive region and dependency degree. To thoroughly analyze the uncertainty, noise and incompleteness of incomplete neighborhood decision systems, some neighborhood tolerance entropy-based uncertainty measures are presented based on Lebesgue and entropy measures. Then, by combining an algebraic view with an information view in neighborhood rough sets, the neighborhood tolerance dependency joint entropy is defined in incomplete neighborhood decision systems. Moreover, all the corresponding properties are discussed, and the relationships among these measures are established to meaningfully convey the knowledge essence and investigate the uncertainty of incomplete neighborhood decision systems. Finally, for all high-dimensional datasets, the Fisher score method is used to preliminarily eliminate irrelevant features to significantly reduce the computational complexity, and a heuristic feature selection algorithm is designed to improve the classification performance of mixed and incomplete datasets. Experiments under an instance and fifteen public datasets demonstrate that the proposed feature selection method is effective in selecting the most relevant features, achieving great classification ability for incomplete neighborhood decision systems.

Read full abstract

Recently, feature selection for multilabel classification has attracted substantial attention in many fields; however, some of the available methods ignore the correlations among labels and yield low classification performance. In addition, most feature selection algorithms that are based on multilabel neighborhood rough sets (MNRS) can only deal with finite sets for multilabel data. To address the issues, this paper presents a novel hybrid filter-wrapper multilabel feature selection method that is based on binary particle swarm optimization (BPSO) and MNRS with the Lebesgue measure for multilabel neighborhood decision systems. First, to overcome the problem that the traditional correlation-based feature selection (CFS) algorithm ignores the dependencies among labels, two types of average correlation between single labels and label sets and among labels are presented. Via combination with information-entropy-based uncertainty measures, a new average correlation among labels is studied. A novel comprehensive evaluation function of CFS (NCFS) is constructed. Then, NCFS is introduced as a fitness function into the original BPSO and improved BPSO algorithms to optimize multilabel classification in the early and later stages, respectively, and the optimization process is terminated when the maximum number of iterations is reached. Next, the Lebesgue measure of the neighborhood class is developed for investigating the neighborhood approximation accuracy and the dependency degree based on MNRS. Various properties are deduced, and the relationships among these measures are used to evaluate the uncertainty and correlations among labels of multilabel data. Finally, a hybrid filter-wrapper feature selection algorithm using NCFS-BPSO is designed for preliminarily eliminating redundant features to decrease the complexity, and a heuristic forward multilabel feature selection algorithm is proposed for improving the performance of multilabel classification. Experimental results on fifteen multilabel datasets demonstrate that our proposed algorithms are effective in selecting significant features and realizing great classification performance in multilabel neighborhood decision systems.

Read full abstract

Neighborhood Decision Systems Research Articles

Related Topics

Articles published on Neighborhood Decision Systems

A distributed attribute reduction based on neighborhood evidential conflict with Apache Spark

Three-way fusion measures and three-level feature selections based on neighborhood decision systems

Feature selection based on self-information and entropy measures for incomplete neighborhood decision systems

AFNFS: Adaptive fuzzy neighborhood-based feature selection with adaptive synthetic over-sampling for imbalanced data

Two‐stage‐neighborhood‐based multilabel classification for incomplete data with missing labels

Multi-label feature selection based on fuzzy neighborhood rough sets

Classification-level and Class-level Complement Information Measures Based on Neighborhood Decision Systems

Feature selection using Fisher score and multilabel neighborhood rough sets for multilabel classification

Multilabel feature selection using ML-ReliefF and neighborhood mutual information for multilabel neighborhood decision systems

Feature Selection Using Fuzzy Neighborhood Entropy-Based Uncertainty Measures for Fuzzy Neighborhood Multigranulation Rough Sets

Incremental updating probabilistic neighborhood three-way regions with time-evolving attributes

Neighborhood multi-granulation rough sets-based attribute reduction using Lebesgue and entropy measures in incomplete neighborhood decision systems

Feature selection using Lebesgue and entropy measures for incomplete neighborhood decision systems

Feature selection using neighborhood entropy-based uncertainty measures for gene expression data classification

An Attribute Reduction Method Using Neighborhood Entropy Measures in Neighborhood Rough Sets.

A Neighborhood Rough Sets-Based Attribute Reduction Method Using Lebesgue and Entropy Measures.

Hybrid Multilabel Feature Selection Using BPSO and Neighborhood Rough Sets for Multilabel Neighborhood Decision Systems

A Gene selection approach based on the fisher linear discriminant and the neighborhood rough set

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Neighborhood Decision Systems Research Articles

Related Topics

Articles published on Neighborhood Decision Systems

A distributed attribute reduction based on neighborhood evidential conflict with Apache Spark

Three-way fusion measures and three-level feature selections based on neighborhood decision systems

Feature selection based on self-information and entropy measures for incomplete neighborhood decision systems

AFNFS: Adaptive fuzzy neighborhood-based feature selection with adaptive synthetic over-sampling for imbalanced data

Two‐stage‐neighborhood‐based multilabel classification for incomplete data with missing labels

Multi-label feature selection based on fuzzy neighborhood rough sets

Classification-level and Class-level Complement Information Measures Based on Neighborhood Decision Systems

Feature selection using Fisher score and multilabel neighborhood rough sets for multilabel classification

Multilabel feature selection using ML-ReliefF and neighborhood mutual information for multilabel neighborhood decision systems

Feature Selection Using Fuzzy Neighborhood Entropy-Based Uncertainty Measures for Fuzzy Neighborhood Multigranulation Rough Sets

Incremental updating probabilistic neighborhood three-way regions with time-evolving attributes

Neighborhood multi-granulation rough sets-based attribute reduction using Lebesgue and entropy measures in incomplete neighborhood decision systems

Feature selection using Lebesgue and entropy measures for incomplete neighborhood decision systems

Feature selection using neighborhood entropy-based uncertainty measures for gene expression data classification

An Attribute Reduction Method Using Neighborhood Entropy Measures in Neighborhood Rough Sets.

A Neighborhood Rough Sets-Based Attribute Reduction Method Using Lebesgue and Entropy Measures.

Hybrid Multilabel Feature Selection Using BPSO and Neighborhood Rough Sets for Multilabel Neighborhood Decision Systems

A Gene selection approach based on the fisher linear discriminant and the neighborhood rough set