A Comparative Study on Different Versions of Multi-Objective Genetic Algorithm for Simultaneous Gene Selection and Sample Categorization

Asit Kumar Das,Sunanda Das

doi:10.1007/978-981-13-1471-1_11

Abstract

Gene selection from microarray gene expression datasets and clustering of samples into different groups are important data mining tasks for disease identification. Selection of more interpretable genes from the gene expression dataset is an essential data-preprocessing task, which helps to study on cancer diseases. Gene selection during sample clustering is inherently a difficult task as there is no obvious criterion to guide the search. Simultaneous gene selection and sample clustering is a two-way data analysis technique which has recently gained attention in research area. The traditional clustering techniques are unable to handle noisy data properly. So, effective clustering algorithms are more desirable which can deal with the relevant and noise free data. Therefore, target genes selection before sample clustering is essential and of course effective if both the tasks are done simultaneously. In this chapter, optimal gene subset is selected and sample clustering is performed simultaneously using Multi-Objective Genetic Algorithm (MOGA). Different versions of MOGA are employed to choose the optimal gene subset, where natural number of optimal clusters of samples is automatically obtained at the end of the process. Non-dominated sorting genetic algorithm (NSGA), Strength pareto evolutionary algorithm (SPEA) and its modified version SPEA2 are applied for the purpose. The methods use nonlinear hybrid uniform cellular automata for generating initial population, tournament selection strategy, two-point crossover operation, and a suitable jumping gene mutation mechanism to maintain diversity in the population. It uses mutual correlation coefficient; internal and external cluster validation indices as objective functions to find out the non-dominated solutions. To measure the cluster validation indices, clustering algorithm is applied on data subset associated to chromosomes in the population to find out different clusters. After the convergence of genetic algorithm, the best solution from the non-dominated solutions is identified that provides the important genes and categorizes the samples into clusters. The experimental results express the correctness of the proposed simultaneous gene selection and sample categorization method. The goodness of optimality of the clusters obtained using different genetic algorithms is expressed by comparing various cluster validation indices.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Comparative Study on Different Versions of Multi-Objective Genetic Algorithm for Simultaneous Gene Selection and Sample Categorization

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Operational Optimization of WDS Based on Multiobjective Genetic Algorithms and Operational Extraction Rules Using Data Mining
Ivaltemir B Carrijo ... Luisa Fernanda R Reis
-
Ivaltemir B Carrijo, et. al.Ivaltemir B Carrijo ... Luisa Fernanda R Reis
25 Jun 2004
25 Jun 2004

Comparative Study on Multi-Objective Genetic Algorithms for Seismic Response Controls of Structures
Young-Jin Cha ... Yeesock Kim
-
Young-Jin Cha, et. al.Young-Jin Cha ... Yeesock Kim
01 Jan 2013
01 Jan 2013

Multi-objective hierarchical genetic algorithms for multilevel redundancy allocation optimization
Ranjan Kumar ... Shinji Nishiwaki
Reliability Engineering & System Safety | VOL. 94
Ranjan Kumar, et. al.Ranjan Kumar ... Shinji Nishiwaki
25 Oct 2008
Reliability Engineering & System Safety | VOL. 94

2 - Genetic algorithms and other heuristic techniques in power systems optimization
Juan Lujano-Rojas ... Rodolfo Dufo-López
Genetic Optimization Techniques for Sizing and Management of Modern Power Systems | VOL. -
Juan Lujano-Rojas, et. al.Juan Lujano-Rojas ... Rodolfo Dufo-López
30 Sep 2022
Genetic Optimization Techniques for Sizing and Management of Modern Power Systems | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comparative Study on Different Versions of Multi-Objective Genetic Algorithm for Simultaneous Gene Selection and Sample Categorization

Abstract

Talk to us

Similar Papers