Increasing Data Set Incompleteness May Improve Rule Set Quality

Jerzy W Grzymala-Busse,Witold J Grzymala-Busse

doi:10.1007/978-3-642-05201-9_16

Abstract

This paper presents a new methodology to improve the quality of rule sets. We performed a series of data mining experiments on completely specified data sets. In these experiments we removed some specified attribute values, or, in different words, replaced such specified values by symbols of missing attribute values, and used these data for rule induction while original, complete data sets were used for testing. In our experiments we used the MLEM2 rule induction algorithm of the LERS data mining system, based on rough sets. Our approach to missing attribute values was based on rough set theory as well. Results of our experiments show that for some data sets and some interpretation of missing attribute values, the error rate was smaller than for the original, complete data sets. Thus, rule sets induced from some data sets may be improved by increasing incompleteness of data sets. It appears that by removing some attribute values, the rule induction system, forced to induce rules from remaining information, may induce better rule sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Increasing Data Set Incompleteness May Improve Rule Set Quality

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Knowledge acquisition in incomplete fuzzy information systems via the rough set approach
Wei‐Zhi Wu ... Wen‐Xiu Zhang
Expert Systems | VOL. 20
Wei‐Zhi Wu, et. al.Wei‐Zhi Wu ... Wen‐Xiu Zhang
10 Oct 2003
Expert Systems | VOL. 20

Learning rules from incomplete training examples by rough sets
T Hong ... S Wang
Expert Systems with Applications | VOL. 22
T Hong, et. al.T Hong ... S Wang
12 Feb 2002
Expert Systems with Applications | VOL. 22

LERS—A Data Mining System
Jerzy W Grzymala-Busse
-
Jerzy W Grzymala-BusseJerzy W Grzymala-Busse
01 Jan 2004
01 Jan 2004

Mining from incomplete quantitative data by fuzzy rough sets
Tzung-Pei Hong ... Been-Chian Chien
Expert Systems With Applications | VOL. 37
Tzung-Pei Hong, et. al.Tzung-Pei Hong ... Been-Chian Chien
20 Aug 2009
Expert Systems With Applications | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Increasing Data Set Incompleteness May Improve Rule Set Quality

Abstract

Talk to us

Similar Papers