A novel measure of attribute significance with complexity weight

Jinfu Liu,Mingliang Bai,Na Jiang,Daren Yu

doi:10.1016/j.asoc.2019.105543

Abstract

Attribute reduction is one of the most important problems in rough set theory. Conventional attribute reduction algorithms are based on minimal errors in seen objects, namely empirical risk minimization. Classification ability in unseen objects, namely generalization ability is more important in actual applications. Therefore, a good reduct should have good generalization ability. Structural risk minimization (SRM) inductive principle is an effective tool to control the generalization ability of learning machines, which considers complexity and errors in seen objects simultaneously. Therefore, this paper introduces the SRM principle into the definition of attribute significance, proposes that the number of rules can characterize the actual complexity of the rough set-based classifier effectively and defines a novel measure of attribute significance with complexity weight. Based on the new attribute significance, a new heuristic attribute reduction algorithm called HSRM-R algorithm is developed. The 10-fold cross-validation experiments in 21 UCI datasets show that HSRM-R algorithm obtains better generalization ability than conventional attribute reduction algorithms based on dependency degree, information entropy, Fisher score and Laplacian score. Further experiments show that HSRM-R algorithm obtains fewer rules and larger support coefficient. This means HSRM-R algorithm can extract stronger rules, which explains why it has better generalization ability to some extent. Although HSRM-R algorithm consumes more time than conventional algorithms, it obtains optimal classification accuracy in almost all datasets used in the experiments. Thus, the proposed HSRM-R algorithm provides an approach to guaranteeing the generalization ability theoretically in the case where users require high classification accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A novel measure of attribute significance with complexity weight

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Journal: Applied Soft Computing	Publication Date: Jun 7, 2019
Citations: 12

Similar Papers

Structural risk minimization of rough set-based classifier
Jinfu Liu ... Na Jiang
Soft Computing | VOL. 24
Jinfu Liu, et. al.Jinfu Liu ... Na Jiang
13 May 2019
Soft Computing | VOL. 24

Support vector regression for porosity prediction in a heterogeneous reservoir: A comparative study
A.F Al-Anazi ... I.D Gates
Computers & Geosciences | VOL. 36
A.F Al-Anazi, et. al.A.F Al-Anazi ... I.D Gates
27 Oct 2010
Computers & Geosciences | VOL. 36

Advantages of Unbiased Support Vector Classifiers for Data Mining Applications
A Navia-Vázquez ... A.R Figueiras-Vidal
The Journal of VLSI Signal Processing-Systems for Signal, Image, and Video Technology | VOL. 37
A Navia-Vázquez, et. al.A Navia-Vázquez ... A.R Figueiras-Vidal
01 Jun 2004
The Journal of VLSI Signal Processing-Systems for Signal, Image, and Video Technology | VOL. 37

Study on the Application of the Theory of Rough Sets to Text Excavation
Dali Yin ... Yan Yan
-
Dali Yin, et. al.Dali Yin ... Yan Yan
01 Jan 2014
01 Jan 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A novel measure of attribute significance with complexity weight

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing