Objective model selection with parallel genetic algorithms using an eradication strategy

Jean‐François Plante,Michel Adès,Maxime Larocque

doi:10.1002/cjs.11775

Abstract

AbstractIn supervised learning, feature selection methods identify the most relevant predictors to include in a model. For linear models, the inclusion or exclusion of each variable may be represented as a vector of bits playing the role of the genetic material that defines the model. Genetic algorithms reproduce the strategies of natural selection on a population of models to identify the best. We derive the distribution of the importance scores for parallel genetic algorithms under the null hypothesis that none of the features has predictive power. They, hence, provide an objective threshold for feature selection that does not require the visual inspection of a bubble plot. We also introduce the eradication strategy, akin to forward stepwise selection, where the genes of useful variables are sequentially forced into the models. The method is illustrated on real data, and simulation studies are run to describe its performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Canadian Journal of Statistics	Publication Date: Jun 5, 2023
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Objective model selection with parallel genetic algorithms using an eradication strategy

Abstract

Talk to us

Similar Papers

More From: Canadian Journal of Statistics

Lead the way for us

Similar Papers

Parallel Genetic Algorithms for Optimizing Resource Utilization in Large-Scale Construction Projects
Amr Kandil ... Khaled El-Rayes
Journal of Construction Engineering and Management | VOL. 132
Amr Kandil, et. al.Amr Kandil ... Khaled El-Rayes
01 May 2006
Journal of Construction Engineering and Management | VOL. 132

Parallel genetic algorithms: advances, computing trends, applications and perspectives
Z Konfrist
-
Z KonfristZ Konfrist
26 Apr 2004
26 Apr 2004

Analyzing synchronous and asynchronous parallel distributed genetic algorithms
Enrique Alba ... José M Troya
Future Generation Computer Systems | VOL. 17
Enrique Alba, et. al.Enrique Alba ... José M Troya
01 Jan 2001
Future Generation Computer Systems | VOL. 17

Research on Grid Resources Schedule Based on an Adaptive Distribute Parallel Genetic Algorithm
Guangyuan Liu ... Jingjun Zhang
Journal of Computers | VOL. 6
Guangyuan Liu, et. al.Guangyuan Liu ... Jingjun Zhang
11 Jan 2011
Journal of Computers | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Objective model selection with parallel genetic algorithms using an eradication strategy

Abstract

Talk to us

Similar Papers

More From: Canadian Journal of Statistics