New uncertainty measurement for categorical data based on fuzzy information structures: An application in attribute reduction

Qinli Zhang,Yiying Chen,Gangqiang Zhang,Zhaowen Li,Lijun Chen,Ching-Feng Wen

doi:10.1016/j.ins.2021.08.089

Abstract

Categorical data is a significant kind of data in machine learning.Generally, rough set theory (RS-theory) deals with categorical data in the following way.First, an equivalence relation based on the equality of attribute values of categorical data is established.Then, information granules (I-granules) based on equivalence classes are obtained.Finally, information structures (I-structures) consisting of I-granules are formed.However, an equivalence relation is too strict, and there are some limitations in the I-structure of a categorical information system (CIS) that may result in filtering out potentially useful information.This paper investigates fuzzy information structures (FI-structures) and new uncertainty measurements for categorical data from the perspective that “the equality of attribute values is fed back to the attribute set”.First, a fuzzy symmetry relation based on the number of attributes with equal attribute values is established. Then, fuzzy information granules (FI-granules) based on the fuzzy symmetry relation are obtained. Next, FI-structures consisting of FI-granules are formed.Finally, some concepts related to FI-structures in a CIS are given.The set vector is used to denote FI-structures, and the inclusion degree is used to study the dependence between FI-structures.In addition, four new uncertainty measurements based on FI-structures in a CIS are proposed, including fuzzy information granulation (Gf), fuzzy information entropy (Hf), fuzzy rough entropy (Erf) and fuzzy information amount (Ef).Moreover, numerical experiments and statistical tests to evaluate the performance of the proposed new measurements are carried out.The results of the paired t-test show that the performance of the four new measurements based on FI-structures is better than that of the corresponding four measurements based on I-structures.Finally, attribute reduction algorithms based on Gf and Hf are presented, and clustering analysis is conducted on the reduced CIS. The experimental results show that the proposed algorithms are effective and perform well on attribute reduction according to three evaluation indicators of clustering performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

New uncertainty measurement for categorical data based on fuzzy information structures: An application in attribute reduction

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Journal: Information Sciences	Publication Date: Aug 29, 2021
Citations: 18

Similar Papers

Feature selection for multiset-valued data based on fuzzy conditional information entropy using iterative model and matrix operation
Dan Huang ... Zhaowen Li
Applied Soft Computing | VOL. 142
Dan Huang, et. al.Dan Huang ... Zhaowen Li
26 Apr 2023
Applied Soft Computing | VOL. 142

Hierarchical structures and uncertainty measures for intuitionistic fuzzy approximation space
Bing Huang ... Xian-Zhong Zhou
Information Sciences | VOL. 336
Bing Huang, et. al.Bing Huang ... Xian-Zhong Zhou
14 Dec 2015
Information Sciences | VOL. 336

Fuzzy information entropy-based adaptive approach for hybrid feature outlier detection
Zhong Yuan ... Shu Wang
Fuzzy Sets and Systems | VOL. 421
Zhong Yuan, et. al.Zhong Yuan ... Shu Wang
04 Nov 2020
Fuzzy Sets and Systems | VOL. 421

Fuzzy complementary entropy using hybrid-kernel function and its unsupervised attribute reduction
Zhong Yuan ... Keyu Liu
Knowledge-Based Systems | VOL. 231
Zhong Yuan, et. al.Zhong Yuan ... Keyu Liu
23 Aug 2021
Knowledge-Based Systems | VOL. 231

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

New uncertainty measurement for categorical data based on fuzzy information structures: An application in attribute reduction

Abstract

Talk to us

Similar Papers

More From: Information Sciences