Maximal Consistent Blocks Research Articles

In this paper, we discuss a rough set approach to missing attribute values. Among many ways of interpreting missing values, in this paper we focus on two interpretations, lost values and “do not care” conditions. Using these interpretations, global and saturated probabilistic approximations are constructed with two types of granules: characteristic sets and maximal consistent blocks. We compare eight approaches, combining two interpretations of missing attribute values, two types of probabilistic approximations with two types of granules using an error rate that is computed as a result of ten-fold cross-validation. Using a 5% level of statistical significance, we present the experimental results for these eight approaches, showing statistically significant differences between all approaches to mining incomplete data. The results also show that no one method and approach is the best for every data set and that all eight approaches should be attempted. The final section of the paper presents the idea of concept-compatible data sets. We show that for these types of data sets, global and saturated probabilistic approximations for a concept are identical to the concept. We also show that for an incomplete data set with no duplicate rows using the lost interpretation of missing attribute values, the data set is concept-compatible.

Read full abstract

Abstract In this paper incomplete data sets, or data sets with missing attribute values, have three interpretations, lost values, attribute-concept values and ‘do not care’ conditions. Additionally, the process of data mining is based on two types of probabilistic approximations, global and saturated. We present results of experiments on mining incomplete data sets using six approaches, combining three interpretations of missing attribute values with two types of probabilistic approximations. We compare our six approaches, using the error rate computed as a result of ten-fold cross validation as a criterion of quality. We show that for some data sets the error rate is significantly smaller (5% level of significance) for lost values, for some data sets the smaller error rate is associated with attribute-concept values, and sometimes with ‘do not care’ conditions. Again, for some approaches the error rate is significantly smaller for saturated probabilistic approximations than for global probabilistic approximations, while for some approaches it is the other way around. Thus, for an incomplete data set, the best approach to data mining should be chosen by trying all six approaches.

Read full abstract

Maximal Consistent Blocks Research Articles

Related Topics

Articles published on Maximal Consistent Blocks

Mining incomplete data using global and saturated probabilistic approximations based on characteristic sets and maximal consistent blocks

Fuzzy and rough approach to the problem of missing data in fall detection system

Handling the Complexity of Computing Maximal Consistent Blocks

TFD-IIS-CRMCB: Telecom Fraud Detection for Incomplete Information Systems Based on Correlated Relation and Maximal Consistent Block

Global and saturated probabilistic approximations based on generalized maximal consistent blocks

A New Approach to Constructing Maximal Consistent Blocks for Mining Incomplete Data

Complexity of rule sets in mining incomplete data using characteristic sets and generalized maximal consistent blocks

Complexity of Rule Sets Mined from Incomplete Data Using Probabilistic Approximations Based on Generalized Maximal Consistent Blocks

Characteristic sets and generalized maximal consistent blocks in mining incomplete data

Tolerance-based multigranulation rough sets in incomplete systems

Neighborhood Systems Based Rough Sets in Grey Fuzzy Information System

NEIGHBORHOOD SYSTEM BASED ROUGH SET: MODELS AND ATTRIBUTE REDUCTIONS

Consistency measure, inclusion degree and fuzzy measure in decision tables

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Maximal Consistent Blocks Research Articles

Related Topics

Articles published on Maximal Consistent Blocks

Mining incomplete data using global and saturated probabilistic approximations based on characteristic sets and maximal consistent blocks

Fuzzy and rough approach to the problem of missing data in fall detection system

Handling the Complexity of Computing Maximal Consistent Blocks

TFD-IIS-CRMCB: Telecom Fraud Detection for Incomplete Information Systems Based on Correlated Relation and Maximal Consistent Block

Global and saturated probabilistic approximations based on generalized maximal consistent blocks

A New Approach to Constructing Maximal Consistent Blocks for Mining Incomplete Data

Complexity of rule sets in mining incomplete data using characteristic sets and generalized maximal consistent blocks

Complexity of Rule Sets Mined from Incomplete Data Using Probabilistic Approximations Based on Generalized Maximal Consistent Blocks

Characteristic sets and generalized maximal consistent blocks in mining incomplete data

Tolerance-based multigranulation rough sets in incomplete systems

Neighborhood Systems Based Rough Sets in Grey Fuzzy Information System

NEIGHBORHOOD SYSTEM BASED ROUGH SET: MODELS AND ATTRIBUTE REDUCTIONS

Consistency measure, inclusion degree and fuzzy measure in decision tables