A Scoring Scheme for Online Feature Selection: Simulating Model Performance Without Retraining.

Debarka Sengupta,Debajyoti Sinha,Sanghamitra Bandyopadhyay

doi:10.1109/tnnls.2016.2514270

Abstract

Increasing the number of features increases the complexity of a model even if the additional feature does not improve its decision-making capacity. Irrelevant features may also cause overfitting and reduce interpretability of the concerned model. It is, therefore, important that the features are optimally selected before a model is built. In the case of online learning, new instances are periodically discovered, and the respective model is tactically retrained as required. Similarly, there are many real-life situations where hundreds of new features are discovered periodically, and the existing model needs to be retrained or tested for its performance improvement. Supervised selection of feature subset usually requires creation of multiple suboptimal models, thus incurring time-intensive computations. Unsupervised selections, although faster, largely rely on some subjective definition of feature relevance. In this paper, we introduce a score that accurately determines the importance of the features. The proposed score is appropriate for online feature selection scenarios for its low time complexity and ability to interpret performance improvement of the current model after the addition of a new feature, without invoking a retraining.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Scoring Scheme for Online Feature Selection: Simulating Model Performance Without Retraining.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems

Lead the way for us

Journal: IEEE Transactions on Neural Networks and Learning Systems	Publication Date: Jan 21, 2016
Citations: 34

Similar Papers

Evaluating feature selection strategies for high dimensional, small sample size datasets
Abhishek Golugula ... Anant Madabhushi
-
Abhishek Golugula, et. al.Abhishek Golugula ... Anant Madabhushi
01 Aug 2011
01 Aug 2011

A Fast Hybrid Feature Selection Method
Mohammad Ahmadi Ganjei ... Reza Boostani
-
Mohammad Ahmadi Ganjei, et. al.Mohammad Ahmadi Ganjei ... Reza Boostani
01 Oct 2019
01 Oct 2019

An Automatic Turner Syndrome Identification System with Facial Images
Guohong Yao ... Jianqiang Li
-
Guohong Yao, et. al.Guohong Yao ... Jianqiang Li
01 Jan 2020
01 Jan 2020

Robust and Optimal Contention Resolution without Collision Detection
Yonggang Jiang ... Chaodong Zheng
-
Yonggang Jiang, et. al.Yonggang Jiang ... Chaodong Zheng
11 Jul 2022
11 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Scoring Scheme for Online Feature Selection: Simulating Model Performance Without Retraining.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems