A novel self-learning feature selection approach based on feature attributions

Jianting Chen,Shuhan Yuan,Dongdong Lv,Yang Xiang

doi:10.1016/j.eswa.2021.115219

Abstract

Feature selection has shown its effectiveness in improving the accuracy and generalization of machine learning models, especially for those tasks with high-dimensional data. In this article, a novel self-learning feature selection (SLFS) approach based on feature attributions is proposed as a wrapper method, which has higher search efficiency for optimal feature subsets with three main improvements. First, we regard feature selection as a combinatorial optimization problem and propose a unified local search framework for wrapper methods by analyzing meta-heuristic algorithms in feature selection. Second, for the binary search space of feature selection, we propose two types of neighborhood structures, namely, ring-type and line-type structures, for the local search framework. Third, we focus on feature attribution methods, such as SHAP (SHapley Additive exPlanations) (Lundberg & Lee, 2017), which can interpret each feature’s importance to predictions. In each iteration, we adopt SHAP values and other attributes from previous subsets to guide the next selection of new subsets. To validate the performance of our SLFS approach, we collected 16 classification datasets from the UCI repository for comparison with other meta-heuristic wrapper approaches in terms of fitness, accuracy, F1 scores and selection ratios. The experimental results show that the SLFS approach can be used to obtain an optimal subset with fewer iterations and a small population, and SHAP values play a role in improving search efficiency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A novel self-learning feature selection approach based on feature attributions

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Journal: Expert Systems with Applications	Publication Date: Jun 7, 2021
Citations: 20

Similar Papers

Feature selection strategies: a comparative analysis of SHAP-value and importance-based methods
Huanjing Wang ... Taghi M Khoshgoftaar
Journal of Big Data | VOL. 11
Huanjing Wang, et. al.Huanjing Wang ... Taghi M Khoshgoftaar
26 Mar 2024
Journal of Big Data | VOL. 11

Establishment and interpretation of the gamma pass rate prediction model based on radiomics for different intensity-modulated radiotherapy techniques in the pelvis
Qianxi Ni ... Xiangshang Sun
Frontiers in Physics | VOL. 11
Qianxi Ni, et. al.Qianxi Ni ... Xiangshang Sun
10 Aug 2023
Frontiers in Physics | VOL. 11

Differences in trajectory of disease activity according to biologic and targeted synthetic disease-modifying anti-rheumatic drug treatment in patients with rheumatoid arthritis
Bon San Koo ... Bin Yoo
Arthritis Research & Therapy | VOL. 24
Bon San Koo, et. al.Bon San Koo ... Bin Yoo
01 Jan 2021
Arthritis Research & Therapy | VOL. 24

A machine learning model to predict treatment initiation among new patients in a community oncology network.
Bo He ... Jody S Garey
Journal of Clinical Oncology | VOL. 41
Bo He, et. al.Bo He ... Jody S Garey
01 Jun 2023
Journal of Clinical Oncology | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A novel self-learning feature selection approach based on feature attributions

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications