Towards the classification of cancer subtypes by using cascade deep forest model in gene expression data

Yang Guo,Shuhui Liu,Xuequn Shang,Zhanhuai Li

doi:10.1109/bibm.2017.8217909

Abstract

The classification of cancer subtypes is of great importance in cancer disease diagnosis and therapy. Many supervised learning methods have been applied to classification of cancer subtypes in the past few years, especially of deep learning based methods. Recently, a deep forest model has been proposed as an alternative of deep neural networks to learn hyper-representations by using cascade ensemble decision trees, and it has been proved that deep forest model has competitive or even better performance than deep neural networks. However, the original deep forest may face under-fitting and ensemble diversity problems when dealing with small sample size, and high-dimension biology data. It is important to improve the deep forest model to work better on small-scale biology data. In this paper, we propose a deep learning model to follow the mission of cancer subtype classification on small-scale biology data sets, which can be viewed as modification of original deep forest model. Our model distinguishes from the original deep forest model with two main contributions: First, a named multi-class-scanning method is proposed to train multiple simple binary classifiers to encourage diversity of ensemble. Meanwhile, the fitting quality of each classifier is considered in representations learning. Second, we propose a boosting strategy to emphasize more important features in cascade forests of representations learning, thus to propagate the benefits of discriminative features among layers to improve the overall classification performance. Systematical experiments on both microarray and RNA-seq data sets demonstrate that our method consistently outperforms the most state-of-the-art classification methods in application of cancer subtype classifications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards the classification of cancer subtypes by using cascade deep forest model in gene expression data

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

BCDForest: a boosting cascade deep forest model towards the classification of cancer subtypes based on gene expression data
Yang Guo ... Shuhui Liu
BMC Bioinformatics | VOL. 19
Yang Guo, et. al.Yang Guo ... Shuhui Liu
01 Apr 2018
BMC Bioinformatics | VOL. 19

A laminar augmented cascading flexible neural forest model for classification of cancer subtypes based on gene expression data
Lianxin Zhong ... Peng Wu
BMC Bioinformatics | VOL. 22
Lianxin Zhong, et. al.Lianxin Zhong ... Peng Wu
02 Oct 2021
BMC Bioinformatics | VOL. 22

A Novel Deep Flexible Neural Forest Model for Classification of Cancer Subtypes Based on Gene Expression Data
Jing Xu ... Hussain Dawood
IEEE Access | VOL. 7
Jing Xu, et. al.Jing Xu ... Hussain Dawood
01 Jan 2019
IEEE Access | VOL. 7

Fracture identification of carbonate reservoirs by deep forest model: An example from the D oilfield in Zagros Basin
Chunqiu Ji ... Ziyi Yang
Energy Geoscience | VOL. 5
Chunqiu Ji, et. al.Chunqiu Ji ... Ziyi Yang
27 Mar 2024
Energy Geoscience | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards the classification of cancer subtypes by using cascade deep forest model in gene expression data

Abstract

Talk to us

Similar Papers