A Parallel Genetic Algorithm Based Feature Selection and Parameter Optimization for Support Vector Machine

Zhi Chen,Xin Xia,Tao Lin,Ningjiu Tang

doi:10.1155/2016/2739621

Zhi Chen, Xin Xia + Show 2 more

Open Access

https://doi.org/10.1155/2016/2739621

Copy DOI

Journal: Scientific Programming	Publication Date: Jan 1, 2016
Citations: 46	License type: CC BY 4.0

Affiliation: Sichuan University

Abstract

The extensive applications of support vector machines (SVMs) require efficient method of constructing a SVM classifier with high classification ability. The performance of SVM crucially depends on whether optimal feature subset and parameter of SVM can be efficiently obtained. In this paper, a coarse-grained parallel genetic algorithm (CGPGA) is used to simultaneously optimize the feature subset and parameters for SVM. The distributed topology and migration policy of CGPGA can help find optimal feature subset and parameters for SVM in significantly shorter time, so as to increase the quality of solution found. In addition, a new fitness function, which combines the classification accuracy obtained from bootstrap method, the number of chosen features, and the number of support vectors, is proposed to lead the search of CGPGA to the direction of optimal generalization error. Experiment results on 12 benchmark datasets show that our proposed approach outperforms genetic algorithm (GA) based method and grid search method in terms of classification accuracy, number of chosen features, number of support vectors, and running time.

Highlights

The overwhelming amount of data that is currently available in any field provides great opportunities for researchers to obtain knowledge that is impossible to obtain before
Despite all the promising results that SVMs provided, it is still a challenge to efficiently construct a SVM classifier which can provide accurate prediction on the unseen new samples. This so-called generalization ability crucially depends on two tasks, namely, feature selection and parameter optimization [2,3,4]
Feature selection is used to identify a subset of available features which is most essential for classification

Summary

Introduction

The overwhelming amount of data that is currently available in any field provides great opportunities for researchers to obtain knowledge that is impossible to obtain before. The trend in recent years is to turn these two tasks into a multiobjective optimization problem so that global search algorithms, such as genetic algorithm (GA) [2, 14, 15], particle swarm optimization (PSO) [3], and ant colony optimization (ACO) [4], can be used to jointly perform these two tasks Jointly performing these two tasks results in a largely expanded solution space, and it requires strong search ability to find optimal feature subset and parameter for SVMs. Besides, given the fact that training SVM even only once needs a great deal of computations, it will be computationally infeasible to apply these global search algorithms into practical use, when the number of training samples increases.

Support Vector Machines

Parallel Genetic Algorithms

Method

Experiments

Limitations and Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Parallel Genetic Algorithm Based Feature Selection and Parameter Optimization for Support Vector Machine

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Programming

Lead the way for us

Similar Papers

Hierarchical polarimetric SAR image classification based on feature selection and Genetic algorithm
Yunyan Wang ... Yu Zhang
-
Yunyan Wang, et. al.Yunyan Wang ... Yu Zhang
01 Oct 2014
01 Oct 2014

Identification of protein methylation sites by coupling improved ant colony optimization algorithm and support vector machine
Zhan-Chao Li ... Xiao-Yong Zou
Analytica Chimica Acta | VOL. 703
Zhan-Chao Li, et. al.Zhan-Chao Li ... Xiao-Yong Zou
11 Aug 2011
Analytica Chimica Acta | VOL. 703

Optimal Features Subset Selection and Classification for Iris Recognition
Kaushik Roy ... Prabir Bhattacharya
EURASIP Journal on Image and Video Processing | VOL. 2008
Kaushik Roy, et. al.Kaushik Roy ... Prabir Bhattacharya
01 Jan 2008
EURASIP Journal on Image and Video Processing | VOL. 2008

A P2P traffic identification approach based on the optimal support vector machine and genetic algorithm
Wang Chunzhi ... Ye Zhiwei
-
Wang Chunzhi, et. al.Wang Chunzhi ... Ye Zhiwei
01 Aug 2016
01 Aug 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Parallel Genetic Algorithm Based Feature Selection and Parameter Optimization for Support Vector Machine

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Programming