Abstract

Mechanisms underlying human diseases have been revealed with the development of molecular biology. The underlying molecular basis of disorders is valuable in prevention, diagnosis, and treatment. Decade-long efforts have been devoted to investigating disease–gene association through positional cloning of disease genes and genome-wide association studies. In particular, correlations among different diseases have been discovered by many clinical cases. The shared disease-associated genes may help reveal the intrinsic relationship in the genetic level, provide an access to evaluate disease similarity, and establish a human disease network. Although many methods have been proposed to measure disease similarity, they only consider the genes or functions directly annotated to diseases but ignore the interactions among genes or functions. These interactions cause deficiency in disease classification. Basing on network-based disease module, we presented a systematic research to further investigate the relationship among different human diseases and explore whether this correlation depends on the functions of corresponding disease genes. On the one hand, a disease clustering based on the separation score between diseases is applied to divide 299 diseases into 15 relatively separated disease clusters. On the other hand, an optimal clustering scheme discriminating 15 disease clusters was learned based on disease-associated genes, their GO terms, and KEGG pathways annotations. The detected key signatures showed the highest relevance to distinguishing distinct disease clusters and represented the essential functions in corresponding pathogenesis. This study provides a novel approach to predict the network and function characteristics and reveals the functional essence of diseases.

Highlights

  • By the early 1900s, the Mendelian law of inheritance described the pattern that some traits can be transmitted from one generation to another

  • An iterative algorithm depending on the association data of 299 diseases was employed to cluster them into 15 disease clusters (Supplementary file 1)

  • The following two kinds of features were first produced on the basis of network function: (1) 500 features extracted by node2vec from STRING protein-protein interaction (PPI) network and (2) 297 KEGG features and 20681 Gene Ontology (GO) features extracted by enrichment on disease-associated genes

Read more

Summary

Introduction

By the early 1900s, the Mendelian law of inheritance described the pattern that some traits can be transmitted from one generation to another. This pattern contributes to building the linkage between phenotypes and genes. Gene mutations showed strong relevance to human diseases, for genetically inherited disorders. With the development of molecular biology, mechanisms underlying human diseases have been revealed. Biological processes caused by gene dysfunction result in various diseases [1], [2]. Uncovering the underlying molecular basis is valuable in the prevention, diagnosis, and treatment of diseases.

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.