A Combined Enhancing and Feature Extraction Algorithm to Improve Learning Accuracy for Gene Expression Classification

Phuoc-Hai Huynh,Van-Hoa Nguyen,Thanh-Nghi Do

doi:10.1007/978-3-030-35653-8_17

Abstract

In recent years, gene expression data combined with machine learning methods revolutionized cancer classification which had been based solely on morphological appearance. However, the characteristics of gene expression data have very-high-dimensional and small-sample-size which lead to over-fitting of classification algorithms. We propose a novel gene expression classification model of multiple classifying algorithms with synthetic minority oversampling technique (SMOTE) using features extracted by deep convolutional neural network (DCNN). In our approach, the DCNN extracts latent features of gene expression data, then the SMOTE algorithm generates new data from the features of DCNN was implemented. These models are used in conjunction with classifiers that efficiently classify gene expression data. Numerical test results on fifty very-high-dimensional and small-sample-size gene expression datasets from the Kent Ridge Biomedical and Array Expression repositories illustrate that the proposed algorithm is more accurate than state-of-the-art classifying models and improve the accuracy of classifiers including non-linear support vector machines (SVM), linear SVM, k nearest neighbors and random forests.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Combined Enhancing and Feature Extraction Algorithm to Improve Learning Accuracy for Gene Expression Classification

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Novel hybrid DCNN–SVM model for classifying RNA-sequencing gene expression data
Phuoc-Hai Huynh ... Thanh-Nghi Do
Journal of Information and Telecommunication | VOL. 3
Phuoc-Hai Huynh, et. al.Phuoc-Hai Huynh ... Thanh-Nghi Do
05 Sep 2019
Journal of Information and Telecommunication | VOL. 3

Aircraft Classification Based on PCA and Feature Fusion Techniques in Convolutional Neural Network
Faisal Azam ... Heejung Yu
IEEE Access | VOL. 9
Faisal Azam, et. al.Faisal Azam ... Heejung Yu
01 Jan 2020
IEEE Access | VOL. 9

Random ensemble oblique decision stumps for classifying gene expression data
Phuoc-Hai Huynh ... Thanh-Nghi Do
-
Phuoc-Hai Huynh, et. al.Phuoc-Hai Huynh ... Thanh-Nghi Do
01 Jan 2018
01 Jan 2018

A Coupling Support Vector Machines with the Feature Learning of Deep Convolutional Neural Networks for Classifying Microarray Gene Expression Data
Phuoc-Hai Huynh ... Van-Hoa Nguyen
-
Phuoc-Hai Huynh, et. al.Phuoc-Hai Huynh ... Van-Hoa Nguyen
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Combined Enhancing and Feature Extraction Algorithm to Improve Learning Accuracy for Gene Expression Classification

Abstract

Talk to us

Similar Papers