CancerSiamese: one-shot learning for predicting primary and metastatic tumor types unseen during model training

Milad Mostavi,Yufei Huang,Yidong Chen,Yu-Chiao Chiu

doi:10.1186/s12859-021-04157-w

Abstract

BackgroundThe state-of-the-art deep learning based cancer type prediction can only predict cancer types whose samples are available during the training where the sample size is commonly large. In this paper, we consider how to utilize the existing training samples to predict cancer types unseen during the training. We hypothesize the existence of a set of type-agnostic expression representations that define the similarity/dissimilarity between samples of the same/different types and propose a novel one-shot learning model called CancerSiamese to learn this common representation. CancerSiamese accepts a pair of query and support samples (gene expression profiles) and learns the representation of similar or dissimilar cancer types through two parallel convolutional neural networks joined by a similarity function.ResultsWe trained CancerSiamese for cancer type prediction for primary and metastatic tumors using samples from the Cancer Genome Atlas (TCGA) and MET500. Network transfer learning was utilized to facilitate the training of the CancerSiamese models. CancerSiamese was tested for different N-way predictions and yielded an average accuracy improvement of 8% and 4% over the benchmark 1-Nearest Neighbor (1-NN) classifier for primary and metastatic tumors, respectively. Moreover, we applied the guided gradient saliency map and feature selection to CancerSiamese to examine 100 and 200 top marker-gene candidates for the prediction of primary and metastatic cancers, respectively. Functional analysis of these marker genes revealed several cancer related functions between primary and metastatic tumors.ConclusionThis work demonstrated, for the first time, the feasibility of predicting unseen cancer types whose samples are limited. Thus, it could inspire new and ingenious applications of one-shot and few-shot learning solutions for improving cancer diagnosis, prognostic, and our understanding of cancer.

Highlights

The state-of-the-art deep learning based cancer type prediction can only predict cancer types whose samples are available during the training where the sample size is commonly large
We developed the CancerSiamese, an Siamese convolutional neural networks (SCNNs) model that contains two identical 1D-convolutional neural networks (CNNs), which learn cancer type representations of query and support samples, followed by a metric-learning layer to predict if the representations from the query and support sample are similar or not
CancerSiamese networks were trained on the Cancer Genome Atlas (TCGA) and MET500 metastatic cancer cohort (MET500) training datasets separately with Keras deep learning (DL) platform with the Tensorflow backend [24]

Summary

Introduction

The state-of-the-art deep learning based cancer type prediction can only predict cancer types whose samples are available during the training where the sample size is commonly large. We hypothesize the existence of a set of type-agnostic expression representations that define the simi‐ larity/dissimilarity between samples of the same/different types and propose a novel one-shot learning model called CancerSiamese to learn this common representation. It becomes increasingly clear that as much as molecular profiles can accurately predict current cancer types, the spectrum of cancer transcends existing tumor lineages, underscoring the need for a molecular-based classification of individual tumors. This emergent perspective of cancer fosters a more effective “precision cancer therapy," which advocates specialized diagnosis and treatments based on individual patients’ molecular makeup [5]

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: May 12, 2021
Citations: 12	License type: open-access

R Discovery Prime

R Discovery Prime

CancerSiamese: one-shot learning for predicting primary and metastatic tumor types unseen during model training

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Development of a 4-miRNA prognostic signature for endometrial cancer.
Jiazhen Huang ... Furong Du
Medicine | VOL. 101
Jiazhen Huang, et. al.Jiazhen Huang ... Furong Du
14 Oct 2022
Medicine | VOL. 101

Pan-cancer analysis of non-coding transcripts reveals the prognostic onco-lncRNA HOXA10-AS in gliomas.
Keren Isaev ... Ricky Tsai
Cell Reports | VOL. 37
Keren Isaev, et. al.Keren Isaev ... Ricky Tsai
01 Oct 2021
Cell Reports | VOL. 37

Abstract 758: Differential gene expression profiling of matched primary and metastatic triple negative breast cancer
Jaspreet Kaur
Cancer Research | VOL. 82
Jaspreet KaurJaspreet Kaur
15 Jun 2022
Cancer Research | VOL. 82

Abstract 743: Fusion gene identification from common cancer cell lines and comparison to primary tumors
Neetha Nanoth Vellichirammal ... Babu Guda
Cancer Research | VOL. 79
Neetha Nanoth Vellichirammal, et. al.Neetha Nanoth Vellichirammal ... Babu Guda
01 Jul 2019
Cancer Research | VOL. 79

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CancerSiamese: one-shot learning for predicting primary and metastatic tumor types unseen during model training

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics