Abstract

BackgroundAccurate cancer classification is essential for correct treatment selection and better prognostication. microRNAs (miRNAs) are small RNA molecules that negatively regulate gene expression, and their dyresgulation is a common disease mechanism in many cancers. Through a clearer understanding of miRNA dysregulation in cancer, improved mechanistic knowledge and better treatments can be sought.ResultsWe present a topology-preserving deep learning framework to study miRNA dysregulation in cancer. Our study comprises miRNA expression profiles from 3685 cancer and non-cancer tissue samples and hierarchical annotations on organ and neoplasticity status. Using unsupervised learning, a two-dimensional topological map is trained to cluster similar tissue samples. Labelled samples are used after training to identify clustering accuracy in terms of tissue-of-origin and neoplasticity status. In addition, an approach using activation gradients is developed to determine the attention of the networks to miRNAs that drive the clustering. Using this deep learning framework, we classify the neoplasticity status of held-out test samples with an accuracy of 91.07%, the tissue-of-origin with 86.36%, and combined neoplasticity status and tissue-of-origin with an accuracy of 84.28%. The topological maps display the ability of miRNAs to recognize tissue types and neoplasticity status. Importantly, when our approach identifies samples that do not cluster well with their respective classes, activation gradients provide further insight in cancer subtypes or grades.ConclusionsAn unsupervised deep learning approach is developed for cancer classification and interpretation. This work provides an intuitive approach for understanding molecular properties of cancer and has significant potential for cancer classification and treatment selection.

Highlights

  • Accurate cancer classification is essential for correct treatment selection and better prognostication. microRNAs are small RNA molecules that negatively regulate gene expression, and their dyresgulation is a common disease mechanism in many cancers

  • MiRNA expression profiles from 2524 samples are included in subsequent analyses below; sample ontology is depicted in Fig. 1

  • We only study a subset of highly activated miRNAs and limit them to those contributing to a total of 75% of activation gradients in their corresponding class

Read more

Summary

Introduction

Accurate cancer classification is essential for correct treatment selection and better prognostication. microRNAs (miRNAs) are small RNA molecules that negatively regulate gene expression, and their dyresgulation is a common disease mechanism in many cancers. In humans and many other species, miRNAs negatively regulate gene expression through molecular interactions between the miRNA seed sequence (nucleotide positions 2–8) and complementary sites in target genes [8] These molecules are excellent biomarkers due to their tractable number (∼ 1200 are encoded within the human genome), abundance, cell-type and disease-stage specificity, and stability in tissue and blood samples [5, 9, 10]. Certain miRNAs have been shown to act as oncomiRs or tumour suppressors based on the function of their target gene [11,12,13] These molecules are, highly valuable biomarkers for cancer research. Similar to other -omics data, miRNA datasets suffer from the curse of dimensionality (i.e. number of variables exceed that of observations), and there is a pressing need for building computational approaches that accurately derive clinically useful information from large data

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.