Genetic Clustering Algorithm-Based Feature Selection and Divergent Random Forest for Multiclass Cancer Classification Using Gene Expression Data

L Senbagamalar,S Logeswari

doi:10.1007/s44196-024-00416-9

Abstract

AbstractComputational identification and classification of clinical disorders gather major importance due to the effective improvement of machine learning methodologies. Cancer identification and classification are essential clinical areas to address, where accurate classification for multiple types of cancer is still in a progressive stage. In this article, we propose a multiclass cancer classification model that categorizes the five different types of cancers using gene expression data. To perform efficient analysis of the available clinical data, we propose feature selection and classification methods. We propose a genetic clustering algorithm (GCA) for optimal feature selection from the RNA-gene expression data, consisting of 801 samples belonging to the five major classes of cancer. The proposed feature selection method reduces the 1621 gene expressions into a cluster of 21 features. The optimum feature set acts as input data to the proposed divergent random forest. Based on the features computed, the proposed classifier categorizes the data samples into 5 different classes of cancers, including breast cancer, colon cancer, kidney cancer, lung cancer, and prostate cancer. The proposed divergent random forest provided performance improvisation in terms of accuracy with 95.21%, specificity with 93%, and sensitivity with 94.29% which outperformed all the other existing multiclass classification algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Computational Intelligence Systems	Publication Date: Feb 5, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Genetic Clustering Algorithm-Based Feature Selection and Divergent Random Forest for Multiclass Cancer Classification Using Gene Expression Data

Abstract

Talk to us

Similar Papers

More From: International Journal of Computational Intelligence Systems

Lead the way for us

Similar Papers

Informative Feature Clustering and Selection for Gene Expression Data
Yuqi Yang ... Zhihang Luo
IEEE Access | VOL. 7
Yuqi Yang, et. al.Yuqi Yang ... Zhihang Luo
01 Jan 2019
IEEE Access | VOL. 7

Variance-based Feature Selection for Classification of Cancer Subtypes Using Gene Expression Data
Aedan G K Roberts ... Daniel R Catchpoole
-
Aedan G K Roberts, et. al.Aedan G K Roberts ... Daniel R Catchpoole
01 Jul 2018
01 Jul 2018

Metaheuristic Search Based Feature Selection Methods for Classification of Cancer
L Meenachi ... S Ramakrishnan
Pattern Recognition | VOL. 119
L Meenachi, et. al.L Meenachi ... S Ramakrishnan
22 Jun 2021
Pattern Recognition | VOL. 119

Comparison of feature selection methods for multiclass cancer classification based on microarray data
Xiaobo Li ... Yueming Xu
-
Xiaobo Li, et. al.Xiaobo Li ... Yueming Xu
01 Oct 2011
01 Oct 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Genetic Clustering Algorithm-Based Feature Selection and Divergent Random Forest for Multiclass Cancer Classification Using Gene Expression Data

Abstract

Talk to us

Similar Papers

More From: International Journal of Computational Intelligence Systems