Improved Regularized Multi-class Logistic Regression for Gene Classification with Optimal Kernel PCA and HC Algorithm.

Nwayyin Najat Mohammed

doi:10.1007/978-3-031-31982-2_31

Abstract

A significant challenge in high-dimensional and big data analysis is related to the classification and prediction of the variables of interest. The massive genetic datasets are complex. Gene expression datasets are enriched with useful genes that are associated with specific diseases such as cancer. In this study, we used two gene expression datasets from the Gene Expression Omnibus and preprocessed them before classification. We used optimal kernel principal component analysis in which the optimal kernel function was chosen for dataset dimensionality reduction and extraction of the most important features. The gene sets with a high validity index were collected using a combined hieratical clustering and optimal kernel principal component analysis (KHC-RLR) algorithm. Logistic regression is one of the most common methods for classification, and it has been shown to be a useful classification approach for gene expression data analysis. In this study, we used multi-class logistic regression to classify the collected gene sets. We found that ordinary logistic regression caused a major overfitting problem; therefore, we used regularized multi-class logistic regression to classify the gene sets. The proposed KHC-RLR algorithm showed a high performance and satisfied accuracy measures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improved Regularized Multi-class Logistic Regression for Gene Classification with Optimal Kernel PCA and HC Algorithm.

Abstract

Talk to us

Similar Papers

More From: Advances in experimental medicine and biology

Lead the way for us

Similar Papers

Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data
Kristina M Hettne ... Esther De Jong
BMC Medical Genomics | VOL. 6
Kristina M Hettne, et. al.Kristina M Hettne ... Esther De Jong
29 Jan 2013
BMC Medical Genomics | VOL. 6

Enhanced Determination of Gene Groups Based on Optimal Kernel PCA with Hierarchical Clustering Algorithm
Nwayyin Najat Mohammed ... Chewan Jalal Mohammed
-
Nwayyin Najat Mohammed, et. al.Nwayyin Najat Mohammed ... Chewan Jalal Mohammed
24 Mar 2021
24 Mar 2021

Abstract 5111: Selection vs modification in cancer
Ivan P Gorlov ... Jinyoung Byun
Cancer Research | VOL. 72
Ivan P Gorlov, et. al.Ivan P Gorlov ... Jinyoung Byun
15 Apr 2012
Abstract 5111: Selection vs modification in cancer
Ivan P Gorlov ... Jinyoung Byun

Differential gene expression associated with estrogen receptor status of breast cancer identified by microarray meta-analysis.
M Holko ... D Scholtens
Cancer Research | VOL. 69
M Holko, et. al.M Holko ... D Scholtens
15 Jan 2009
Cancer Research | VOL. 69

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improved Regularized Multi-class Logistic Regression for Gene Classification with Optimal Kernel PCA and HC Algorithm.

Abstract

Talk to us

Similar Papers

More From: Advances in experimental medicine and biology