Determining the Number of Iterations Appropriate for Ensemble Gene Selection on Microarray Data

David J Dittman,Amri Napolitano,Randall Wald,Taghi M Khoshgoftaar

doi:10.1109/icmla.2012.23

Abstract

Ensemble gene (feature) selection is a promising new strategy with many benefits including more stable gene lists and improved classification results. The ensemble portion is achieved through multiple runs of feature selection which are then aggregated into a single result. The critical question is how many iterations of feature selection are appropriate. Too few iterations can make classification performance suffer. However, too many iterations will cause issues regarding computational costs. The goal is to choose the correct number of iterations to maximize classification performance without expending too much computational power. Our paper is an in-depth study on the effect of the number of iterations of feature selection on classification performance. Our work employs eleven DNA microarray datasets, on which we apply various ensemble methods, feature selection techniques, classifiers, and feature subset sizes. The results show that using 10 iterations of feature ranking during ensemble feature selection is not sufficient to optimize classification results and that a larger number of iterations is required (20 or 50). However, there is very little distinction between 20 iterations and 50 iterations, as both produce very similar classification results. Our recommendation is to use 20 iterations because while 20 iterations and 50 iterations perform similarly, 20 iterations has a much smaller computation time. To our knowledge there has not been a previous study as expansive as this one on the effects of the number of iterations of feature selection on ensemble feature selection.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Determining the Number of Iterations Appropriate for Ensemble Gene Selection on Microarray Data

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Feature list aggregation approaches for ensemble gene selection on patient response datasets
Taghi M Khoshgoftaar ... Amri Napolitano
-
Taghi M Khoshgoftaar, et. al.Taghi M Khoshgoftaar ... Amri Napolitano
01 Aug 2013
01 Aug 2013

The Effect of Number of Iterations on Ensemble Gene Selection
Wael Awada ... David Dittman
-
Wael Awada, et. al.Wael Awada ... David Dittman
01 Dec 2012
01 Dec 2012

Feature selection and its combination with data over-sampling for multi-class imbalanced datasets
Chih-Fong Tsai ... Wei-Chao Lin
Applied Soft Computing | VOL. 153
Chih-Fong Tsai, et. al.Chih-Fong Tsai ... Wei-Chao Lin
17 Jan 2024
Applied Soft Computing | VOL. 153

Comparing Two New Gene Selection Ensemble Approaches with the Commonly-Used Approach
David J Dittman ... Amri Napolitano
-
David J Dittman, et. al.David J Dittman ... Amri Napolitano
01 Dec 2012
01 Dec 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Determining the Number of Iterations Appropriate for Ensemble Gene Selection on Microarray Data

Abstract

Talk to us

Similar Papers