Abstract

Gene co-expression network analysis is an effective method for predicting gene functions and disease biomarkers. However, few studies have systematically identified co-expressed genes involved in the molecular origin and development of various types of tumors. In this study, we used a network mining algorithm to identify tightly connected gene co-expression networks that are frequently present in microarray datasets from 33 types of cancer which were derived from 16 organs/tissues. We compared the results with networks found in multiple normal tissue types and discovered 18 tightly connected frequent networks in cancers, with highly enriched functions on cancer-related activities. Most networks identified also formed physically interacting networks. In contrast, only 6 networks were found in normal tissues, which were highly enriched for housekeeping functions. The largest cancer network contained many genes with genome stability maintenance functions. We tested 13 selected genes from this network for their involvement in genome maintenance using two cell-based assays. Among them, 10 were shown to be involved in either homology-directed DNA repair or centrosome duplication control including the well- known cancer marker MKI67. Our results suggest that the commonly recognized characteristics of cancers are supported by highly coordinated transcriptomic activities. This study also demonstrated that the co-expression network directed approach provides a powerful tool for understanding cancer physiology, predicting new gene functions, as well as providing new target candidates for cancer therapeutics.

Highlights

  • Distinct types of human cancer share similar traits, including rapid cell proliferation, loss of cell identity, and the ability to migrate and seed malignant tumors in distal locations

  • The datasets were selected such that the sample size in each dataset is above a minimal threshold to maintain the significance level of Pearson Correlation Coefficient (PCC) computation (p-values,0.05 for PCC values larger than the threshold as described in Materials and Methods)

  • The frequencies of highly correlated gene pairs were used as weights to build a weighted gene co-expression frequency network (WGCFN)

Read more

Summary

Introduction

Distinct types of human cancer share similar traits, including rapid cell proliferation, loss of cell identity, and the ability to migrate and seed malignant tumors in distal locations. Understanding these common traits and identifying the underlying genes/networks are key to gaining insight into cancer physiology, and, to prevent and cure cancer. In order to overcome this hurdle to identify functionally related genes associated with disease development and prognosis, several approaches have been adopted One such approach is gene coexpression analysis, which identifies groups of genes that are highly correlated in expression levels across multiple samples [4,5,6,7,8,9]. We were able to identify new gene functions in regulating cell mitosis in breast cancer [5,11] by studying genes that have high correlation with the expression of the DNA repair protein, BRCA1

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call