Abstract

Disease classification based on gene information has been of significance as the foundation for achieving precision medicine. Previous works focus on classifying diseases according to the gene expression data of patient samples, and constructing disease network based on the overlap of disease genes, as many genes have been confirmed to be associated with diseases. In this work, the effects of diseases on human biological functions are assessed from the perspective of gene network modules and pathways, and the distances between diseases are defined to carry out the classification models. In total, 1728 diseases are divided into 12 and 14 categories by the intensity and scope of effects on pathways, respectively. Each category is a mix of several types of diseases identified based on congenital and acquired factors as well as diseased tissues and organs. The disease classification models on the basis of gene network are parallel with traditional pathology classification based on anatomic and clinical manifestations, and enable us to look at diseases in the viewpoint of commonalities in etiology and pathology. Our models provide a foundation for exploring combination therapy of diseases, which in turn may inform strategies for future gene-targeted therapy.

Highlights

  • Characterizing disease in the biological big data era of the twentyfirst century has been of significance [1], based on pathological analysis and clinical syndromes and molecularlevel information, including gene data

  • Each category is a mix of several types of diseases identified based on congenital and acquired factors as well as diseased tissues and organs, which implies that the human gene network gives a perspective of disease classifications, and guides future gene-targeted therapy and combination therapy of diseases

  • As each module has a different number of genes, we apply the adjusted cosine similarity to measure the correlations of pathway genes distribution in network modules

Read more

Summary

Introduction

Characterizing disease in the biological big data era of the twentyfirst century has been of significance [1], based on pathological analysis and clinical syndromes and molecularlevel information, including gene data. Many network-based approaches are proposed in order to explore the associations between human genetic diseases and the relations between their effector genes [22,23,24,25]. Lee et al [24] construct a bipartite human disease association network where the disease pairs are connected if mutated enzymes associated with them catalyse adjacent metabolic reactions This network topology-based approach helps to uncover potential mechanisms that contribute to their shared pathophysiology [24]. Li et al take a comprehensive perspective on oncogenic processes based on Pan-Cancer Atlas analyses, giving prominence to the complex impact of genome alterations on the signalling and multi-omic profiles of human cancers as well as their influence on tumour microenvironment [9]. Each category is a mix of several types of diseases identified based on congenital and acquired factors as well as diseased tissues and organs, which implies that the human gene network gives a perspective of disease classifications, and guides future gene-targeted therapy and combination therapy of diseases

Data collection
Module partition based on the fast unfolding algorithm
The influence on pathways by diseases
Results
Genes and interactions: underlying framework of the human gene network
Module partition: bridge between individual genes and pathways
Disease classification: in terms of the influence on pathways
Classification by the intensity of effects on pathways
Classification by the scope of effects on pathways
Associations and differences between the two classifications
44. Hu S et al 2009 Profiling the human protein- 22

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.