Gene expression profiles are sequences of numbers, and the need to analyze them has now increased significantly. Gene expression data contain a large number of genes and models used for cancer classification. As the wealth of these data being produced, new prediction, classification and clustering techniques are applied to the analysis of the data. Although there are a number of proposed methods with good results, there is still limited diagnostics and a lot of problems still to be solved. To solve the difficulty, in this paper, an efficient gene expression data classification is proposed. To predict the cancer class of patients from the gene expression profile, this paper presents a novel classification framework in the manner of three steps namely, Pre-processing, feature selection and classification. In pre-processing, missing value is filled and redundant data are removed. To attain the enhanced classification outcomes, the important features are selected from the database with the help of Adaptive Salp Swarm Optimization (ASSO) algorithm. Then, the selected features are given to the multi kernel SVM (MKSVM) to classify the gene expression data namely, BRCA, KIRC, COAD, LUAD and PRAD. The performance of proposed methodology is analyzed in terms of different metrics namely, accuracy, sensitivity and specificity. The performance of proposed methodology is 4.5% better than existing method in terms of accuracy.
Read full abstract