Feature selection using Information Gain and decision information in neighborhood decision system

Kanglin Qu,Jiucheng Xu,Qincheng Hou,Kangjian Qu,Yuanhao Sun

doi:10.1016/j.asoc.2023.110100

Abstract

Feature selection is a significant preprocessing technique for data mining, which can promote the accuracy of data classification and shrink feature space by eliminating redundant features. Since traditional feature selection algorithms have high time complexity and low classification accuracy, an effective algorithm using Information Gain and decision information is designed. The algorithm introduces Information Gain for performing preliminary dimensionality reduction on high dimensional datasets, and then the decision information is regarded as an evaluation function of features to select features with important information. First, the concept of joint information granule is defined, and neighborhood information entropy measures are proposed based on the joint information granule. In addition, the relationship between these measures is studied, which is helpful to study the uncertainty in data. Second, a nonmonotonic algorithm using the decision information in the neighborhood information entropy measures is proposed to overcome the shortcoming of algorithms based on monotonic evaluation functions, thereby improving the accuracy of data classification. Third, to reduce the time cost of the designed algorithm for high dimensional datasets, Information Gain is introduced to preliminarily eliminate irrelevant features in high dimensional datasets. Finally, the ablation and comparison experiments on twelve public datasets demonstrate the low time cost and high classification accuracy of our algorithm, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Feature selection using Information Gain and decision information in neighborhood decision system

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Journal: Applied Soft Computing	Publication Date: Feb 10, 2023
Citations: 16

Similar Papers

Copula entropy-based golden jackal optimization algorithm for high-dimensional feature selection problems
Heba Askr ... Aboul Ella Hassanien
Expert Systems with Applications | VOL. 238
Heba Askr, et. al.Heba Askr ... Aboul Ella Hassanien
19 Sep 2023
Expert Systems with Applications | VOL. 238

A hybrid feature selection algorithm combining ReliefF and Particle swarm optimization for high-dimensional medical data
Zhaozhao Xu ... Derong Shen
Journal of Intelligent & Fuzzy Systems | VOL. -
Zhaozhao Xu, et. al.Zhaozhao Xu ... Derong Shen
01 Jan 2020
Journal of Intelligent & Fuzzy Systems | VOL. -

An adaptive dual-strategy constrained optimization-based coevolutionary optimizer for high-dimensional feature selection
Tao Li ... Jiu-Cheng Xu
Computers and Electrical Engineering | VOL. 118
Tao Li, et. al.Tao Li ... Jiu-Cheng Xu
14 Jun 2024
Computers and Electrical Engineering | VOL. 118

Feature selection based on multiview entropy measures in multiperspective rough set
Jiucheng Xu ... Yuanhao Sun
International Journal of Intelligent Systems | VOL. 37
Jiucheng Xu, et. al.Jiucheng Xu ... Yuanhao Sun
01 Apr 2022
International Journal of Intelligent Systems | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature selection using Information Gain and decision information in neighborhood decision system

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing