A fusion of centrality and correlation for feature selection

Ping Qiu,Chunxia Zhang,Dongping Gao,Zhendong Niu

doi:10.1016/j.eswa.2023.122548

Abstract

The rapid development of computer and database technologies has led to the high growth of large-scale datasets. This produces an important issue for data mining applications called the curse of dimensionality, where the number of features is much higher than the number of patterns. One of the dimensionality reduction approaches is feature selection, which can increase the accuracy of these applications and reduce their computational complexity. This paper proposes a novel feature selection method to reduce the dimensionality and computational complexity in high-dimensional data processing. First, based on the centrality and Fisher score, a probabilistic strategy metric is proposed to measure the influence of features. Second, a new discriminant function is proposed to determine whether one feature should be selected. It can automatically calculate weight parameters to balance the relevance of a feature to class labels and the redundancy of the selected feature subset. Finally, a new method is proposed by combining the maximum information coefficient (MIC), total information (TI) and centrality technique, named MTC_FS. The experimental results show that the average accuracy of the MTC_FS improves by 1.8% compared with the best baseline, and the comprehensive performance of the MTC_FS is superior to all baselines on 12 public datasets. MTC_FS has a shorter runtime on all datasets than all baselines. In addition, the performance of the MTC_FS method is the most stable on the NBayes classifier.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A fusion of centrality and correlation for feature selection

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Similar Papers

Feature selection using non-dominant features-guided search for gene expression profile data
Xiaoying Pan ... Yufeng Xue
Complex & Intelligent Systems | VOL. -
Xiaoying Pan, et. al.Xiaoying Pan ... Yufeng Xue
26 Apr 2023
Complex & Intelligent Systems | VOL. -

Feature Selection Algorithm Based on Maximum Information Coefficient
Weiwei Pan
-
Weiwei PanWeiwei Pan
12 Mar 2021
12 Mar 2021

A Diagnosis Method based on Maximum Information Coefficient and MKL for Open Circuit Fault in PMSM inverter
Chenyang Wan ... Liuxuan Wei
-
Chenyang Wan, et. al.Chenyang Wan ... Liuxuan Wei
21 Oct 2021
21 Oct 2021

Adaptive feature selection method with FF-FC-MIC for the detection of mutual faults in rotating machinery
Xiaoyun Gong ... Zeheng Zhi
Journal of Vibroengineering | VOL. 24
Xiaoyun Gong, et. al.Xiaoyun Gong ... Zeheng Zhi
30 Jun 2022
Journal of Vibroengineering | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A fusion of centrality and correlation for feature selection

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications