Abstract

AbstractDNA microarray analysis plays a prominent role in classifying genes related to cancer. The dimension of the data is high and difficult to handle during classification. Hence, the dimension has to be reduced and highly predictive gene features must be obtained without affecting the accuracy. Previous studies concentrated either on improving the classification accuracy or reduction of gene features. Here, the multi‐objective problem of obtaining reduced gene features with high classification accuracy is addressed using the proposed correlation feature selection filter and binary bat algorithm (BBA) with greedy crossover. The gene feature subsets are obtained using the correlation based feature selection filter and optimized using the BBA. Suboptimal solutions obtained due to pre‐convergence of BBA are reset using the proposed greedy crossover. Highly predictive genes features are obtained and evaluated with support vector machine 10‐fold cross‐validation. An average classification accuracy of 95.85% with predictive gene features <1% of the total dataset was obtained when applied on cancer microarray datasets. The solution for the multi‐objective problem of obtaining high classification accuracy with minimal number of genes is achieved with better performance over the existing algorithms. Also, the problem of pre‐convergence with suboptimal solutions in optimization algorithms is overcome.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call